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ft 

ORBDIS BIOMED Forschungs- und 
Entwicklungs GmbH 



Polypeptides and nucleic acids encoding these and their use for the preventi- 
on, diagnosis or treatment of liver disorders and epithelial cancer 



The invention relates to polypeptides, nucleic acids encoding these or a nucleic 
acid according to SEQ ID No. 19 and to their use for the diagnosis, prevention 
and/or treatment of liver disorders and neoplastic disorders especially cancer of the 
liver and other epithelial tissues, benign liver neoplasms such as adenoma and 
other proliferative liver disorders such as focal nodular hyperplasia (FNH) and 
cirrhosis. The invention further relates to methods of diagnosing and treating of 
these disorders. 

The development of cancer in general is characterized by genetic mutations that 
alter activity of important cellular pathways including, for example, proliferation, 
apoptosis (cell death), response to stress and epithelial/stroma interactions. It is 
increasingly recognized that identification of nucleic acids that are deregulated in 
cancer can provide important new insight into the mechanisms of neoplastic trans- 
formation. Identification of deregulated nucleic acid expression in precancerous 
stages, such as macro regenerative nodules and the "large" and "small" cell change 
in liver cancer, provide understanding of early events in malignant transformation. 
Similarly, identification of deregulated gene expression in disorders characterized 
by tissue proliferation and remodeling, such as FNH and cirrhosis in the liver may 
distinguish nucleic acids involved in proliferation and malignant transformation. 
Together such deregulated nucleic acids and the encoded gene products have po- 
tential as new diagnostic markers for cancer. Moreover, the products of these dere- 
gulated nucleic acids per se t are targets for therapeutic intervention in the preventi- 
on and/or treatment of these disorders in human patients. 
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The liver plays a vital role in the metabolism of proteins, lipids, carbohydrates, 
nucleic acids and vitamins. There are numerous disorders effecting the liver that 
cannot be diagnosed, prevented or treated effectively, such as hepatocellular 
carcinoma (HCC). Examination of HCC is particularly well suited for the identifi- 
cation of deregulated gene expression in cancer. This is because tissue samples of 
HCC can be obtained from surgically resected tumors and the tumors are well cir- 
cumscribed solid structures with little stromal tissue. Furthermore, as indicated 
above, there is the possibility for comparative analyses of benign and malignant 
tumors as well as cirrhosis, a non-neoplastic condition. If the limitations in the art 
of identifying differentially expressed genes associated with liver disorders could 
be overcome; this comparative approach may enable identification of deregulated 
nucleic acids specifically involved in the processes of cellular proliferation and 
tissue remodeling in a mature organ (e.g., in cirrhosis) as well as the identification 
and discrimination of gene expression alterations associated with hyperplasia (such 
as FNH) and with benign and malignant neoplasms (e.g., adenoma and HCC). In 
HCC there is an urgent need for new and better diagnostic and therapeutic capabi- 
lities. Deregulated genes in liver cancer may also be highly relevant to other can- 
cers of the gastrointestinal tract and indeed with other carcinomas (epithelial de- 
rived cancers) as these tissues share a common embryological origin. 

On a global basis, hepatocellular carcinoma (HCC) belongs to the most common 
malignant tumors accounting for about 1 million deaths/year. In Southeast Asia the 
incidence rates are about 150/100,000 population/year with clear male predomi- 
nance. In western countries 2 to 4 cases/100,000 population are observed annually 
and the number is still rising. In developing countries, the frequency of HCC cor- 
relates with the hepatitis B-virus carrier state, particularly after vertical transmis- 
sion, affecting younger age groups often in the absence of liver cirrhosis. Else- 
where, HCC is mainly found in older patients with liver cirrhosis most commonly 
associated with chronic hepatitis C-virus infection and/or chronic alcohol abuse 
(Ishak et al, 1999.Atlas of Tumor Pathology. Fascicle 31. Armed Forces Institute of 
Pathology, Washington, DC). 
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Definitive diagnosis of neoplastic liver disorders such as HCC and many other 
tumors relies upon histopathological evaluation of biopsy specimens. This invasive 
surgical procedure is generally not undertaken until symptoms appear and the dis- 
ease is then most often in advanced stages, thereby limiting therapeutic interven- 
tion options. Thus there is a need to improve diagnostics and methods of diagnosis. 
In addition, early diagnosis is crucial but hampered by late onset or even a lack of 
specific clinical symptoms. At diagnosis most HCC tumors are no longer amenable 
to surgical resection (except encapsulated tumors or the fibrolamellar variants) 
(Chen and Jeng, 1997, J. Gastroenterol. Hepatol. 12: 329-34); moreover, they are 
highly resistant to cytostatic therapy (Kawata et al., 2001 Br. J. Cancer 84:886-91). 
Overall, death usually occurs within 1 year after diagnosis. Thus, markers for early 
detection, prognostic indicators, and effective prevention and/or treatment regi- 
mens for HCC are highly desirable in this field. 

In contrast, unlike the well-studied situation in colorectal cancer, liver adenoma 
may not represent a precursor lesion of HCC. Similarly, although cirrhosis and 
hepatitis viral infections are clearly risk factors for HCC, these conditions are not 
prerequisite for the development of HCC. Certain liver lesions may represent HCC 
prestages such as macro regenerative nodular hyperplasia, but this is not yet con- 
firmed (Shortell and Schwartz, 1991, Surg Gynecol Obstet. 173:426-31; Anthony, 
P. in MacSween et al, eds. Pathology of the Liver. 2001, Churchill Livingstone, 
Edinburgh). Although these disorders are diagnosed by histopathological investi- 
gation of liver resections and liver biopsies, no efficient method exists for earlier or 
non-invasive detection of these conditions. Again, there is immediate need for dia- 
gnostic and prognostic markers for these neoplasms and for non-invasive detection 
of these disorders. 

Within the past decade, several technologies have made it possible to monitor the 
expression level of a large number of transcripts within a cell at any one time (see, 
e.g., Schena et al., 1995, Science 270:467-470; Lockhart et al., 1996, Nature Bio- 
technology 14:1675-1680; Blanchard et al., 1996, Nature Biotechnology 14, 1649; 
1996, US 5,569,588). Transcript array technology has been utilized for the identifi- 
cation of genes that are up regulated or down regulated in various disordered states. 



Several recent studies have utilized this technology to examine changes in gene 
expression in human hepatocellular carcinoma. These studies have variously re- 
vealed deregulation (i.e., over- and underexpression) of genes encoding liver speci- 
fic proteins in HCC cell lines and HCC tissues relative to controls. Moreover the 
studies revealed genes essential for cell cycle control, stress response, apoptosis, 
lipid metabolism, cell-cell-interaction, DNA repair and cytokine and growth factor 
production (Graveel et al, 2001, Oncogene 20:2704-12; Kawai et al, 2001, Hepa- 
tology 33:676-91; Lau et al, 2000, Oncol. Res. 12:59-69; Nagai et al, 1998, Cancer 
82:454-61; Okabe et al, 2001, Cancer Res 61:2129-37; Salvucci et al, 1999, Onco- 
gene 18:181-187; Shirota et al, 2001, Hepatology 33:832-40; Tackels-Home et al, 
2001, Cancer 92: 395-405; Wu et al, 2001, Oncogene 20:2674-3682; Xu et al, 
2001, Cancer Res. 61:3176-81). However, there is little concordance in the gene 
expression patterns reported in these studies that may be due to differences in expe- 
rimental design and/or to the heterogeneity of HCC tissue per se. Moreover, the 
etiologies of these HCCs are an important factor. Chronic hepatitis B and C virus 
infections are the major causes of HCC but damage from alcohol and chronic liver 
metabolic disorders are also recognized to result in HCC and the mechanisms re- 
sponsible for development of a tumor from these different etiologies are likely to 
differ* Taken together, until now no satisfactory diagnostics and methods of dia- 
gnosing have been developed in order to be able to intervene in liver disorders. 

The same applies to the therapy of liver disorders, and epithelial cancers. For HCC 
for instance, there is no effective therapeutic option except resection and trans- 
plantation but these approaches are only applicable in early stages of HCC, limited 
by the access to donor livers, and associataed with severe risks for the patient. In 
addition, these approaches are extremely expensive. These cancers respond very 
poorly to chemotherapeutics, most likely due the normal liver function in detoxifi- 
cation and export of harmful compounds. Several other therapeutic options, such as 
chemoembolization, cryotherapy and ethanol injection are still in an experimental 
phase and the efficacy of these is not established. Surgical intervention remains the 
best treatment option but it is not possible to define with precision the extent of the 
tumor. This invasive procedure therefore, is suboptimal from the perspective of 
treatment Furthermore, the lack of early diagnostics for specific liver dysfunctions 
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leads most often to advanced progression of the disease that further confounds the- 
rapeutic options and dramatically increases patient mortality from these diseases 
(Jansen P.L., 1999, Neth. J. Med. 55:287-292). Thus until now no satisfactory the- 
rapies have been developed in order to be able to intervene in liver disorders, and 
other epithelial cancers. Furthermore, in the state of the art, a recognition of the 
different subtypes of liver disorders such as HCC precursor lesions, benign liver 
neoplasms, and metabolic liver diseases such as alcoholic liver disease and cirrho- 
sis, as revealed by differential gene expression, have not been disclosed. 

A summary of the key disease features of some of the disorders evaluated in the 
invention is provided in the following table. , 
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It is, therefore, an object of the present invention to make available diagnostics, 
methods of diagnosis, pharmaceutical compositions and methods of prevention 
and/or treatment of liver disorders, especially of hepatocellular carcinoma (HCC), 
and epithelial cancers, pre-cancerous liver lesions, benign neoplasms such as ade- 
noma, hyperplasias (i.e., the focal nodular variety) and cirrhosis that overcome the 
limitations present in the art. Suprisingly the evaluation of multiple liver disorders 
with overlapping but distinct morphological and clinical features provided new 
information for identification and discrimination and ultimately therapeutic strate- 
gies for disorders of the liven 



The object of the invention is solved by a method of diagnosis of a liver disorder, 
liver cancer and/or epithelial cancer, wherein at least one compound selected from 
the group consisting of a polypeptide according to the sequence of SEQ ID No. 1 to 
SEQ ID No. 9, a functional variant thereof, a nucleic acid encoding the polypepti- 
de, a nucleic acid according to SEQ ID No. 19, a variant of one of the aforemen- 
tioned nucleic acids, an antibody directed against the polypeptide, and a fragment 
of the antibody, is identified in the sample of a patient and compared with at least 
one compound of a reference library or of a reference sample. 

The object of the invention is also solved by a method of treating a patient suffering 
from a liver disorder or an epithelial cancer, wherein at least one component selec- 
ted from the group consisting of a polypeptide according SEQ ID No. 1-9, a func- 
tional variant of the polypeptide, a nucleic acid encoding the polypeptide, a nucleic 
acid encoding the functional variant, a nucleic acid coding for the SEQ ID No. 19, a 
variant of one of the aforementioned nucleic acids, a nucleic acid which is a non- 
functional mutant variant of one of the aforementioned nucleic acids, a nucleic acid 
having a sequence complementary to one of the aforementioned nucleic acids, a 
vector containing one of the aforementioned nucleic acids, a cell containing one of 
the aforementioned nucleic acids, a cell containing the vector, an antibody directed 
against one of the aforementioned polypeptides, an antibody directed against a 
functional variant of one of the aforementioned polypeptides, a fragment of one of 
the aforementioned antibodies, a vector containing a nucleic acid coding for one of 
the aforementioned antibodies, a vector containing a nucleic acid coding for one of 
the aforementioned antibody fragments, a cell containing the vector containing a 
nucleic acid coding for one of the aforementioned antibodies, and a cell containing 
the vector containing a nucleic acid coding for one of the aforementioned antibody 
fragments, is administered to the patient in need of a the treatment in a therapeuti- 
cally effective amount. 

In another aspect of the invention it is provided a pharmaceutical composition 
containing at least one compound selected from the group consisting of a polypep- 
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tide according to the invention, a functional variant thereof, a nucleic acid encoding 
the polypeptide or a nucleic acid according to SEQ ID No. 19, a variant of one of 
the aforementioned nucleic acids,, a.nucleic acid which is a non-functional mutant 
variant of one of the aforementioned nucleic acids, a nucleic acid having a se- 
quence complementary to one of the aforementioned nucleic acids, a vector contai- 
ning one of the aforementioned nucleic acids, a cell containing one of 

A subset of these nucleic acids and polypeptides according to the invention have 
further been shown to be specifically expressed or deregulated in other cancers of 
epithelial origin and preferably not in corresponding normal human tissue(s). These 
nucleic acids preferably include SEQ ID Nos. 11-16, and 19 (Figure 25). Deregu- 
lated nucleic acids in liver cancer, in particular may preferably be highly relevant 
to other cancers of the gastrointestinal tract as these tissues share a common em- 
bryological origin. Consequently, these nucleic acids and the encoded polypeptides 
may preferably be similarly utilized for diagnostics methods of diagnosis, pharma- 
ceutical compositions and methods of prevention and/or treatment of these epithe- 
lial cancers. 

The polypeptides and nucleic acids according to the invention have in common that 
they are differentially expressed in a sample isolated from a patient suffering from 
a disorder according to the invention compared to a reference sample. Preferably 
the regulation of the polypeptides and nucleic acids according to the invention is 
essential for the pathologic process and which are thus in a direct or indirect relati- 
onship with diagnosis, prevention and/or treatment of disorders according to the 
invention. The polypeptides and the nucleic acids according to the invention pre- 
ferably do not belong to the targets known until now such that suprising and com- 
pletely novel approaches for diagnosis and therapy result from this invention. 

Generally, the analysis of differentially expressed genes in tissues is less likely to 
result in errors in the form of artifactual false-positive clones than the analysis of 
cell culture systems. In addition to the fact that existing cell culture systems cannot 
adequately simulate the complexity of pathological processes'in the tissue, the va- 
riations in cell behavior in the culture environment lead to nucleic acid and poly- 
peptide expression patterns with questionable relation to the actual pathologic state. 
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These problems may be less pronounced by an approach that utilizes gene expres- 
sion in normal and diseased human tissue but again multiple variables confound 
clear identification of differential gene expression that is directly relevant to disea- 
. se. For example, differentially expressed nucleic acids may result from inter- 
individual differences, metabolic state and/or clinical treatment paradigm. Further, 
large scale gene expression studies using cDNA microarrays do not indicate the 
cellular source of variation in gene expression. In addition, a differential gene ex- 
pression study including all or most genes produces a very large volume of data 
that confounds identification of key disease-associated gene expression changes. 
Consequently, an approach that includes large scale profiling of gene expression 
from tissue from liver disorders that are defined only generally (as for example, 
'liver tumors') is unlikely to illuminate key genes involved in the disease process 
and it is these key genes that represent best targets for diagnostics and therapeutic 
intervention. 

On account of these difficulties, the success of the screening was significantly de- 
pendent on the choice of the experimental parameters. While the methods used are 
based on established procedures, the screening and verification strategy is already 
inventive per se owing to the elaborate and defined choice of parameters. A unique 
approach employed in this invention has utilized discrete, pathologist-confirmed 
liver cancer pathologies for production of disease specific cDNA libraries enriched 
in nucleic acids specifically up- and down-regulated in HCC compared with a pool 
of non-neoplastic human livers. Non-diseased reference liver samples for the expe- 
riments was also diagnostically confirmed and pooled from 3 independent samples 
to reduce detection of false positives resulting from inter-individual variations. 
Nucleic acids commonly expressed at similar levels in the reference liver pool and 
in diseased liver (i.e., HCC) were removed by the generation of subtractive 
suppressive hybridization (SSH) cDNA libraries (Diatchenko et al., 1996, Proc. 
Natl. Acad. Sci. USA 93:6025-6030). These cDNAs are highly enriched for nucleic 
acids both up- and down-regulated in HCC but do not represent those that are not 
differentially expressed. Each of several thousand SSH clones were amplified by 
the polymerase chain reaction (PCR) and affixed to glass slides in custom cDNA 
microarrays. RNA from additional pathologist-confirmed liver disorders was con- 
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verted to fluorescentiy-labeled cDNA for competitive hybridization with the poo- 
led non-diseased liver RNA on the microarrays. The resulting ratio of hybridization 
intensity revealed nucleic acids specifically deregulated, in liver disorders. In addi- 
tion to providing a pool of candidate cDNAs highly enriched for differentially ex- 
pressed genes, the SSH library represents on a genome-wide scale most if not all 
differentially expressed genes with far fewer clones than in standard cDNA libra- 
ries. This feature thereby focuses on nucleic acids specifically deregulated in disea- 
se. Furthermore, the SSH libraries generated in this invention include cDNA clones 
from nucleic acids that are essentially not expressed in normal liver and thereby not 
represented in conventional cDNA libraries or on genome-scale cDNA micro- 
arrays. 

In a first aspect the invention relates to a polypeptide comprising a sequence accor- 
ding to the SEQ ID No. 2 or a functional variant thereof. The invention also relates 
to a nucleic coding for the polypeptide or a functional variant thereof, in particular 
to the nucleic acid according to the SEQ ED No. 11 and variants thereof. 

The nucleic acids and polypeptides according to the invention can be utilized for 
the diagnosis, prevention and treatment of liver disorders, and epithelial cancers. 

In a preferred embodiment the polypeptide consists of the sequence according to 
the SEQ ED No. 2. hi another preferred embodiment the nucleic acid consists of the 
SEQ ED No. 11. 

Compared to the state of the art, these polypeptides and nucleic acids surprisingly 
allow improved, more sensitive, earlier, faster, and/or non-invasive diagnosis of the 
liver disorders and/or epithelial cancers. 

In another aspect of the invention the invention relates to the use of at least one 
polypeptide according SEQ ID No. 1-9, at least one functional variant of the poly- 
peptide, at least one nucleic acid encoding the polypeptide, at least one nucleic acid 
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encoding the functional variant, at least one nucleic acid coding for the SEQ ID 
No. 19, at least one variant of one of the aforementioned nucleic acids, at least one 
nucleic acid which is. a non-functional mutant variant of one of the. aforementioned 
nucleic acids, at least one nucleic acid having a sequence complementary to one of 
the aforementioned nucleic acids, at least one vector containing one of the afore- 
mentioned nucleic acids, at least one cell containing one of the aforementioned 
nucleic acids, at least one cell containing the vector, at least one antibody directed 
against one of the aforementioned polypeptides, at least one antibody directed 
against a functional variant of one of the aforementioned polypeptides, at least one 
fragment of one of the aforementioned antibodies, at least one vector containing a 
nucleic acid coding for one of the aforementioned antibodies, at least one vector 
containing a nucleic acid coding for one of the aforementioned antibody fragments, 
at least one cell containing the vector containing a nucleic acid coding for one of 
the aforementioned antibodies, and/or at least one cell containing the vector contai- 
ning a nucleic acid coding for one of the aforementioned antibody fragments, for 
the diagnosis, prevention and/or treatment of disorders according to the invention. 
Further embodiments of the invention are described in detail below. 

When compared to the state of the art of therapy of liver disorders, and/or epithelial 
cancers the use of these components surprisingly provide an improved, sustained 
and/or more effective diagnosis, prevention and/or treatment of disorders according 
to the invention. 

The term "polypeptide" refers to the full length of the polypeptide according to the 
invention. In a preferred embodiment the term "polypeptide" also includes isolated 
polypeptides and polypeptides that are prepared by recombinant methods, e.g. by 
isolation and purification from a sample, by screening a library and by protein 
synthesis by conventional methods, all of these methods being generally known to 
the person skilled in the art Preferably, the entire polypeptide or parts thereof can 
be synthesized, for example, with the aid of the conventional synthesis such as the 
Merrifield technique. In another preferred embodiment, parts of the polypeptides 
according to the invention can be utilized to obtain antisera or specific monoclonal 
antibodies, which may be used to screen suitable gene libraries prepared to express 



-li- 
the encoded protein sequences in order to identify further functional variants of the 
polypeptides according to the invention. 

The term "polypeptide according to the invention" refers to the polypeptides accor- 
ding to the SEQ ID No. 1 to SEQ ID No. 9. 

The term "functional variants" of a polypeptide within the meaning of the present 
invention refers to polypeptides which have a sequence homology, in particular a 
sequence identity, of about 70%, preferably about 80%, in particular about 90%, 
especially about 95%, most preferred of 98 % with the polypeptide having the ami- 
no acid sequence according to one of SEQ ID No. 1 to SEQ ID No. 9. Such func- 
tional variants are, for example, the polypeptides homologous to a polypeptide 
according to the invention, which originate ftom organisms other than human, pre- 
ferably from non-human mammals such as, for example mouse, rats, monkeys and 
pigs. Outer examples of functional variants are polypeptides which are encoded by 
different alleles of the gene, in different individuals, in different organs of an orga- 
nism or in different developmental phases. Functional variants, for example, also 
include polypeptides which are encoded by a nucleic acid which is isolated from 
non-liver-tissue, e.g. embryonic tissue, but after expression in a cell involved in 
liver disorders have the designated functions. Functional variants preferably also 
include naturally occurring or synthetic mutations, particularly mutations that 
quantitatively alter the activity of the peptides encoded by these sequences. 
Further, such variants may preferably arise from differential splicing of the enco- 
ding gene. 

"Functional variants" refer to polypeptides that have essentially the same biological 
funtion(s) as the corresponding polypeptide according to the invention. Such biolo- 
gical function can be assayed in a functional assay. 

In order to test whether a candidate polypeptide is a functional variant of a poly- 
peptide according the invention, the candidate polypeptide can be analyzed in a 
functional assay generally known to the person skilled in the art, which assay is 
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suitable to assay the biological function of the corresponding polypeptide accor- 
ding to the invention. Such functional assay comprise for example cell culture sy- 
stems; the generation of mice, in which the genes are deleted ("knocked out") or 
mice that are transgenic for gene encoding the candidate polypeptide; enzymatic 
assays, etc. If the candidate polypeptide demonstrates or directly interferes with 
essentially the same biological function as the corresponding polypeptide accor- 
ding to the invention, the candidate polypeptide is a functional variant of the corre- 
sponding polypeptide, provided that the candidate polypeptide fulfills the require- 
ments on the level of % sequence identity mentioned above. 

Furthermore, the term "functional variant" encompasses polypeptides that are pre- 
ferably differentially expressed in patients suffering from liver disorders, or other 
epithelial cancers relative to a reference sample or a reference library, including 
polypeptides expressed from mutated genes or from genes differentially spliced, 
provided that the candidate functional variant polypeptide fulfills the criteria of a 
functional variant on the level of % sequence identity. Such expression analysis 
can be carried out by methods generally known to the person skilled in the art 

"Functional variants" of the polypeptide can also be parts of the polypeptide ac- 
1 cording to the invention with a length of at least from about 7 to about 1000 amino 
acids, preferably the polypeptide is comprised of at least 10 amino acids, more 
preferably at least 20, most preferred at least 50, for example at least 100, for ex- 
ample at least 200, for example at least 300, for example at least 400, for example 
at least 500, for example at least 600 amino acids provided that they have essen- 
tially the same biological function(s) as the corresponding polypeptide according to 
the invention. Also included are deletions of the polypeptides according to the in- 
vention, in 1he range from about 1-30, preferably from about 1-15, in particular 
from about 1-5 amino acids provided that they have essentially the same biological 
function(s) as the corresponding polypeptide according to the invention. For exam- 
ple, the first amino acid methionine can be absent without the function of the poly- 
peptide being significantly altered. Also, post-translational modifications, for ex- 
ample lipid anchors or phosphoryl groups may be present or absent in variants. 
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lf Sequence identity" refers to the degree of identity (% identity) of two sequences, 
that in the case of polypeptides can be determined by means of for example 
BLASTP 2.0.1 and in the case of nucleic acids by means of for example BLASTN 
2,014, wherein the Filter is set off and BLOSUM is 62 (Altschul et aL, 1997, 
Nucleic Acids Res., 25:3389-3402). 

"Sequence homology" refers to the similarity (% positives) of two polypeptide 
sequences determined by means of for example BLASTP 2.0.1 wherein the Filter is 
set off and BLOSUM is 62 (Altschul et aL, 1997, Nucleic Acids Res., 25:3389- 
3402). 

The term "liver disorder" refers to and comprises all kinds of disorders that pre- 
ferably affect the anatomy, physiology, metabolic, and/or genetic activities of the 
liver, that preferably affect the generation of new liver cells, and/or the regenerati- 
on of the liver, as a whole or parts thereof preferably transiently, temporarily, chro- 
nically or permanently in a pathological way. Preferably also included are inherited 
liver disorders and neoplastic liver disorders. Liver disorder is further understood 
to preferably comprise liver disorders caused by trauma, intoxication, in particular 
by alcohol, drugs or food intoxication, radiation, infection, cholestasis, immune 
reactions, and by inherited metabolic liver diseases. Preferred examples of liver 
liver disorders include cirrhosis, alcoholic liver disease, chronic hepatitis, Wilson's 
Disease, and heamochromatosis. Preferably further included are autoimmune- 
disorders wherein the autoimmune response is directed against at least one poly- 
peptide according to the invention. Within the meaning of the present invention the 
term "liver disorder" preferably also encompasses liver cancer, for example hepa- 
tocellular carcinoma (HCC), benign liver neoplasms such as adenoma and/or FNH. 
Preferably HCC further comprises subtypes of the mentioned disorders, preferably 
including liver cancers characterized by intracellular proteinaceous inclusion bo- 
dies, HCCs characterized by hepatocyte steatosis, and fibrolamellar HCC. For ex- 
ample, precancerous lesions are preferably also included such as those characteri- 
zed by increased hepatocyte cell size (the "large cell" change), and those characte- 
rized by decreased hepatocyte cell size (the "small cell" change) as well as macro 
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regenerative (hyperplastic) nodules (Anthony, P. in MacSween et al, eds. Patholo 
gy of the Liver. 2001, Churchill Livingstone, Edinburgh), 



The term "epithelial cancer" within the meaning of the invention includes adeno- 
carcinomas of any organ other than the liver preferably of the lung, stomach, kid- 
ney, colon, prostate, skin and breast, and refers to disorders of these organs in 
which epithelial cell components of the tissue are transformed resulting in a ma- 
lignant tumor identified according to the standard diagnostic procedures as gene- 
rally known to a person skilled in the art* 

Within the meaning of the invention the term "disorder according to the invention" 
encompasses epithelial cancer and liver disorders as defined above. 

In the case of polypeptides, the term "differential expression of a polypeptide" re- 
fers to the relative level of expression of the polypeptide in an isolated sample from 
a patient compared to the expression of the polypeptide in a reference sample or a 
reference library. The expression can be determined by methods generally known 
to the person skilled in the art Examples of such methods include immunohisto- 
chemical or immunoblot or ELISA detection of the polypeptide with antibodies 
specific for the polypeptide. Detection of the polypeptide through genetic manipu- 
lation to label the polypeptide and detection in a model system is preferably also 
included such as by tagging the polypeptide in a transgene for expression in a mo- 
del system. 

Hie -term "sample" refers to a biomaterial comprising liver tissue or liver cells, 
preferably tissue from another organ subject to malignant transformation or a cell 
from this organ, blood, serum, plasma, ascitic fluid, pleural effusions, cerebral spi- 
nal fluid, saliva, urine, semen, feces. 

The sample can be isolated from a patient or another subject by means of methods 
including invasive or non-invasive methods. Invasive methods are generally known 
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to the skilled artisan and comprise for example isolation of the sample by means of 
puncturing, surgical removal of the sample from the opened body or by means of 
endoscopic instruments. Minimally invasive and non-invasive methods are also 
known to the person skilled in the art and include for example, collecting body 
fluids such as blood, serum, plasma, ascitic, pleural and cerebral spinal fluid, sali- 
va, urine, semen, and feces. Preferably the non-invasive methods do not require to 
penetrate or open the body of a patient or subject through openings other than the 
body openings naturally present such as the mouth, ear, nose, rectum, urethra, and 
open wounds. 

The term "minimally invasive" procedure refers to methods generally known, espe- 
cially by persons skilled in the art, for obtaining patient sample material that do 
preferably not require anesthesia, can be routinely accomplished in a physician 
office or clinic and are either not painful or only nominally painful. The most 
common example of a minimally invasive procedure is venupuncture. 

The term "reference sample" refers to a sample that serves as an appropriate control 
to evaluate the differential expression of a nucleic acid and/or a polypeptide accor- 
ding to the invention in a given sample isolated from a patient; the choice of such 
appropriate reference sample is generally known to the person skilled in the art 
Examples of reference samples include samples isolated from a non-diseased organ 
or tissue or cell(s) of the same patient or from another subject, wherein the 
non-diseased organ or tissue or cell(s) is selected from the group consisting of liver 
tissue or liver cells, blood, or the samples described above. For comparison to ex- 
pression in the sample isolated from a patient with the liver disorder, the reference 
sample may also include a sample isolated from a non-diseased organ or tissue or 
cell(s) of a different patient, wherein the liver disordered- tissue or cell(s) is selec- 
ted from the sample group listed above. Moreover the reference may include 
samples from healthy donors, preferably matched to the age and sex of the patient. 

The term "reference library" refers to a library of clones representing expressed 
genes, which, library is preferably prepared from non-diseased liver tissue or cells. 
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The reference may also derive from mRNA from non-diseased liver tissue or cells 
and may also comprise a data base containing data on non-diseased tissue expressi- 
on of nucleic acids. For comparison of the expression of the nucleic acids or poly- 
peptides according to the invention, a sample isolated from a patient with the di- 
sordered liver, the reference library may comprise an expression library prepared 
from liver disorder-diseased liver tissue or cells and a data base containing data on 
liver disorder-specific expression of nucleic acids. 

The term "patient" within the meaning of the invention includes animals, pre- 
ferably mammals, and humans, dead or alive. The patient is either suffering from a 
liver disorder, and/or an other epithelial cancer, subject to analysis, preventive 
measures, therapy and/or diagnosis in the context of liver disorder and/or an other 
epithelial cancer. 

The term "subject" within the meaning of the invention includes animals, pre- 
ferably mammals, and humans, dead or alive that are not suffering from a liver 
disorders and/or an other epithelial cancer and thus represent a preferred appro- 
priate control for the determination of differential expression of nucleic acids 
and/or polypeptides according to the invention in a patient 

The term "effective treatment" within the meaning of the invention refers to a 
treatment that preferably cures the patient from at least one disorder according to 
the invention and/or that improves the pathological condition of the patient with 
respect to at least one symptom associated with the disorder, preferably 3 sym- 
ptoms, more preferably 5 symptoms, most preferably 10 symptoms associated with 
the disorder; preferably on a transient, short-term (in the order of hours to days), 
long-term (in the order of weeks, months or years) or permanent basis, wherein the 
improvement of the pathological condition may be preferably constant, increasing, 
decreasing, continuously changing or oscillatory in magnitude as long as the over- 
all effect is a significant improvement of the symptoms compared with a control 
patient Therapeutic efficacy and toxicity, e.g. ED 5 o and LD50 may be determined 
by standard pharmacological procedures in cell cultures or experimental animals. 
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The dose ratio between therapeutic and toxic effects is the therapeutic index and 
may be expressed by the ratio LD50/ED50. Pharmaceutical compositions which ex- 
hibit large therapeutic indexes are preferred, the dose must be adjusted to the age, 
weight and condition of the individual patient to be treated, as well as the route of 
administration, dosage form and regimen, and the result desired, and the exact do- 
sage should of course be determined by the practitioner. 

The actual dosage depend on the nature and severity of the disorder being treated, 
and is within the discretion of the physician, and may be varied by titration of the 
dosage to the particular circumstances of this invention to produce the desired the- 
rapeutic effect However, it is presently contemplated, that pharmaceutical compo- 
sitions containing of from about 0.1 to 500 mg of the active ingredient per indivi- 
dual dose, preferably of from about 1 to 100 mg, most preferred from about 1 to 10 
mg, are suitable for therapeutic treatments. 

The active ingredient may be administered in one or several dosages per day. A 
satisfactory result can, in certain instances, be obtained at a dosage as low as 0.1 
ug/kg intravenously (i.v.) and 1 ug peroral (p.o.). Preferred ranges are from 0.1 
ug/kg/day to about 10 mg/kg/day i.v., and from 1 ug/kg/day to about 100 
mg/kg/day p.o. 

In another aspect the invention relates to a fusion protein containing a polypeptide 
according to the SEQ ID No. 1 to 9 or a functional variant thereof. 

A "fusion protein" refers to a polypeptide containing at least one polypeptide ac- 
cording to the SEQ ED No. 1 to 9, a functional variant or part thereof and at least 
one component A selected from polypeptide, peptide and/or peptide analogue that 
is linked to the polypeptide according to the invention by means of covalent or 
non-covalent binding such as e.g. hydrogen bonds, generally known to the person 
skilled in the art. Preferred examples of component A for fusion proteins are poly- 
peptide, peptide and/or peptide analogues, that facilitate easier detection of the 
fusion proteins, these are, for example, "green-fluorescent-protein" or variants the- 
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reof. Also included are fusion proteins that facilitate purification of the recombi- 
nant protein such as "his-tags", or fusions that increase the immunogenicity of the 
protein. * .......... . . 

Fusion proteins according to the invention can be produced by methods generally 
known to the person skilled in the art. The fusion proteins according to the inventi- 
on can be used for the diagnosis, prevention and or treatment of liver disorders 
and/or epithelial cancer. 

Compared to the state of the art, these fusion proteins surprisingly allow improved, 
more sensitive, earlier, faster, and/or non-invasive diagnosis and/or improved, su- 
stained and/or more effective treatment of the liver disorders and/or epithelial can- 
cers. 

In a further embodiment of the invention a nucleic acid encoding a polypeptide 
comprising a polypeptide according to the invention or a functional variant thereof 
or a nucleic acid according to SEQ ID No. 19, or a variant thereof is provided. Pre- 
ferred nucleic acids according to the invention have a sequence according to one of 
SEQ ID No. 10 to SEC ID No, 18 or a nucleic acid according to SEQ ID No. 19, or 
a variant thereof. In particular the invention relates to nucleic acids according to the 
invention that have been isolated. 

Compared to the state of the art, these nucleic acids and polypeptides surprisingly 
allow improved, more sensitive, earlier, fester, and/or non-invasive diagnosis 
and/or improved, sustained- and/or more effective treatment of the liver disorders 
and/or epithelial cancers. 

The term "nucleic acid according to the invention" refers to the nucleic acids ac- 
cording to the SEQ ID No. 10 to SEQ ID No. 19. 
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The term "encoding nucleic acid" relates to a DNA sequence which codes for an 
isolatable bioactive polypeptide according to the invention or a precursor thereof 
The polypeptide can be encoded by a sequence of full length or any part of the 
coding sequence as long as the biological function, such as for example receptor- 
activity, is essentially retained (cf. definition of functional variant). 

It is known that small alterations in the sequence of the nucleic acids described 
above can be present, for example, due to the degeneration of the genetic code, or 
that untranslated sequences can be attached to the 5' and/or 3 f end of the nucleic 
acid without significantly affecting the activity of the encoded polypeptide. This 
invention, therefore, also comprises so-called naturally occurring and artificially 
generated "variants" of the nucleic acids described above. 

Preferably, the nucleic acids used according to the invention are DNA or RNA, 
preferably a DNA, in particular a double-stranded DNA. In particular the nucleic 
acid according to the invention may be an RNA molecule, preferably single- 
stranded or a double-stranded RNA molecule. The sequence of the nucleic acids 
may further comprise at least one intron and/or one polyA sequence. 

Nucleic acids according to the invention can be produced by methods generally 
known to the skilled artisan and have also been described in detail below. 

"Variant" within the meaning of the invention refers to all DNA sequences which 
are complementary to a DNA sequence, which hybridize with the reference se- 
quence under stringent conditions and have a similar activity to the corresponding 
polypeptide according to the invention. The nucleic acids according to the inventi- 
on can also be used in the form of their antisense sequence. 

"Variant" of the nucleic acids can also be parts of the nucleic acid according to the 
present invention with at least about 8 nucleotides length, preferably with at least 
about 16 nucleotides length, in particular with at least about 21 nucleotides length, 
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more preferably with at least about 30 nucleotides length, even more preferably 
with at least about 40 nucleotides length, most preferably with at least about 50 
nucleotides length as long as the parts have a similar activity to the corresponding 
polypeptide according to the invention. Such activity can be assayed using the 
functional assays described further above. 

In a preferred embodiment of the invention the nucleic acid comprises a nucleic 
acid having a sequence complementary to a nucleic acid according to the invention 
or a variant thereof Preferably the nucleic acid comprises a non-functional mutant 
variant of the nucleic acid according to the invention or a variant thereof. 

In particular the invention relates to a nucleic acid having a complementary se- 
quence wherein the nucleic acid is an antisense molecule or an UNA interference 
molecule. 

The term "non-functional mutant variant of a nucleic acid" refers to a nucleic acid 
derived from a nucleic acid according to the invention or a variant thereof having 
been mutated such that the polypeptide encoded by the non-functional mutant vari- 
ant of the nucleic acid exhibits a biological activity which in comparison the non- 
mutated polypeptide is significantly decreased or abolished. Such activity of the 
polypeptide encoded by the non-functional mutant variant nucleic acid can be de- 
termined by means of a functional assay as described above for the evaluation of 
functional variants. The construction and screening of such non-functional mutant 
variant derived from a nucleic acid according to the invention are generally known 
to the person skilled in the art. Such "non-functional mutant variant of a nucleic 
acid" according to the invention can be expressed in a patient and will preferably 
abolish or diminish the level of expression of the targeted nucleic acid by compe- 
ting with the native mRNA molecules for translation into polypeptides by the ribo- 
somes. 
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"Stringent hybridization conditions" refer to those conditions in which hybridizati- 
on takes place at 60°C in 2.5 x SSC buffer and remains stable following a number 
of washing steps at 37°C in a buffer of lower salt concentration. 

The term "differential expression of a nucleic acid" refers to the relative level of 
expression of the nucleic acid in an isolated sample from a patient compared to the 
expression of the nucleic acid in a reference sample or a referencelibrary. Definiti- 
ons of reference samples and reference libraries have been described in detail abo- 
ve. The expression can be determined by methods generally known to the person 
skilled in the art. Examples of such methods include RNA blot (northern) analysis, 
nuclease protection, in situ hybridization* reverse transcriptase PCR (RT-PCR4 
including quantitative kinetic RT-PCR). cDNA and oligonucleotide microarrays 
are also included as such methods. 

In a preferred embodiment the nucleic according to the invention is the OBcll 
cDNA (SEQ ID 10), which was assembled by identification of overlapping se- 
quences from the non-redundant and human EST GenBank sequence databases. 
The expression in HCC of RNA corresponding to each assembled sequence was 
confirmed experimentally. The initial sequence upregulated in HCC relative to 
non-diseased liver identified as an SSH cDNA clone corresponded to GenBank 
sequence AL050205. The 5* end of that sequence overlaps with AF131755; this 
sequence was extended progressively 5' with XM1 13703, AK055521 and 
AY004310. The latter three sequences include the open reading frame encoding 
OBcll ,pr (SEQ ED 1). In support of this mRNA construct, all overlapping cDNA 
sequences can be localized to the same chromosome. Furthermore, an mRNA of 
approximately 6 kilobases was identified by RNA blot analysis of HCC but not 
normal liver RNA using the SSH sequence from this clone as a hybridization probe 
(Figure 27). Expression of sequences corresponding to this clone have not pre- 
viously been reported in liver or in HCC. 

In a preferred embodiment the polypeptide according to the invention is the 
OBcll. pr polypeptide sequence (SEQ ID 1) which was surprisingly identified from 
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an mRNA identified to be upregulated in HCC by an average of 2.9-fold relative to 
non-diseased liver (OBcll, SEQ ED 10). cDNA sequences encoding this polypepti- 
de and overlapping with this mRNA were identified with reverse transcriptase PCR 
analysis and these nucleic acids are similarly elevated in HCC. This polypeptide 
sequence was previously unrecognized with respect to elevated levels in HCC. 
From the sequence of the OBcll. pr polypeptide, two conserved sequence domains 
can be identified with the conserved domain prediction CDD algorithm available 
with the BLAST sequence analysis tools (Altschul et aL, 1997, Nucleic Acids Res., 
25:3389-3402); a lupus La polypeptide type RNA binding domain (SEQ ID No. 1, 
amino acids 47 to 125), and a GTPase enzymatic domain with unknown function 
(SEQ ID No/ 1, amino acids 90 to 203). The OBcll.pr sequence has been designa- 
ted in the GenBank sequence database as the cellular myeloproliferative leukemia 
receptor (c-Mpl) binding polypeptide. Although a potential modulator of the mye- 
loproliferative leukemia virus receptor (also known as the thrombopoietin recep- 
tor), the functional role for this polypeptide has not been described in any system. 
Similarly, the expression pattern of this polypeptide has not been disclosed. The 
mRNA encoding this polypeptide is elevated more than 2-fold relative to non- 
diseased liver in 1 1 of 21 liver tumors subjected to expression profiling (52%). The 
mRNA encoding this polypeptide is similarly elevated at least 2-fold in 4 of 4 focal 
nodular hyperplasia (FNHs) profiled (100%) (Figure 21). For this and the other 
nucleic acids according to the invention, this value for expression includes the ex- 
pression value ratio data from all of the 21 HCC samples subjected to the cDNA 
microarray expression profiling experiments, including the values from samples 
that were not elevated by 2-fold or greater. 

The expression of this mRNA is remarkably specific to liver disorders since ex- 
pression is not detected in other carcinomas analyzed nor in non-diseased tissues 
including liver, kidney, stomach, lung, skin and others (see Figure 25). Therefore it 
was surprisingly found that there is a strong and specific correlation between the 
expression of OBcll.pr polypeptide (SEQ ID No.l) and the nucleic acid encoding 
the polypeptide (SEQ ID No. 10) respectively and the disorders according to the 
invention. Therefore the polypeptide and the encoding nucleic acid can be utilized 
for diagnosis of disorders according to the invention, for example for the diagnostic 
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discrimination between hyperplastic (including neoplastic) liver diseases and cir- 
rhosis. Furthermore these results demonstrate that OBcll.pr polypeptide (SEQ ID 
No.l) and the nucleic acid encoding the polypeptide (SEQ ID No- 10) can be em- 
ployed in the prevention and therapy of disorders according to the invention, in 
particular for the treatment of hyperplastic (including neoplastic) liver diseases. 
With regard to the treatment it is preferred to carry out the treatment such that the 
expression of the OBcll.pr polypeptide or of the nucleic acid encoding the poly- 
peptide is reduced and/or inhibited, for example by administering antisense oligo- 
nucleotides or RNA interference molecules that specifically interact with the 
nucleic acid encoding the OBcll.pr polypeptide. Alternatively the treatment may ■ 
be carried out such that the activity of the OBcll.pr polypeptide is reduced and/or 
inhibited, for example by administering an antibody directed against the OBcll.pr 
polypeptide or an antibody fragment thereof which block the activity of the 
OBcll .pr polypeptide to a patient in need of such treatment. Compared to the state 
of the art, this OBcll.pr polypeptide and/or OBcll nucleic acid surprisingly allow 
improved, more sensitive, earlier, faster, and/or non-invasive diagnosis and/or im- 
proved, sustained and/or more effective treatment of the liver disorders and/or epit- 
helial cancers. 

In a preferred embodiment the nucleic acid according to the invention is the OBclS 
(SEQ ID 1 1) which is the compiled sequence encoding OBclSpr. (SEQ ID 2). The 
discrete nucleic acid sequence as well as the deduced polypeptide sequence are 
novel. The entire sequence was established from a number of GenBank expressed 
sequence tag (EST) database sequences and GenBank genomic database sequences 
and each segment was verified for overexpression in HCC. For example, the se- 
quence for this nucleic acid on a cDNA microarray is elevated an average of 24.7- 
fold relative to non-diseased liver reference (Figure 21). Expression of partial se- 
quences corresponding to this clone has been reported in several tissues and some 
tumors (including fetal liver and colon adenocarcinoma) but the entire sequence 
has not previously been implicated in HCC. 

Information concerning expression of this and all sequences according to the in- 
vention is obtained from searching of public domain databases (such as the Pub- 
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Med and SOURCE). Journal articles have not been published for most of the se- 
quences according to the invention. The relative abundance of cDNA clones from 
automatically sequenced cDNA libraries therefore provides the evidence cited her- 
ein for expression of this and other sequences according to the invention. This in- 
formation is accessed via databases such as 'SOURCE' (provided by the Genetics 
Department, Stanford University) that includes data curated from UniGene, Swiss- 
Prot, GeneMap99, RHdb, dbEST, GeneCards and LocusLink databases. 

In a preferred embodiment the polypeptide according to the invention is the 
OBclS.pr polypeptide (SEQ ID 2) which represents the largest open reading frame 
from this deregulated mRNA sequence. This polypeptide sequence does not con- 
tain recognized sequence homologies to characterized polypeptides or to known 
structural motifs. No pattern of expression has been described for this polypeptide. 
Expression of the mRNA encoding this polypeptide is elevated greater than 2 fold 
in 100% of HCC cases examined relative to non-diseased liver and greater than 8- 
fold in 17 of the 21 cases profiled (81%). Elevated expression of the encoding 
mRNA relative to non-diseased liver is also evident in liver adenoma, FNH, and 
cirrhotic livers but the transcript is less dramatically upregulated in cirrhosis. The 
mRNA encoding this polypeptide is expressed in non-diseased human lung, brain 
(cortex), colon, testis tissue but not in most other carcinomas evaluated. Overex- 
pression of this polypeptide and/or the encoding mRNA, therefore may be useful 
for diagnosis of liver disorders. These results clearly demonstrate that the OBclS.pr 
polypeptide (SEQ ID 2) and the nucleic acid encoding the polypeptide (SEQ ID 
1 1) can be utilized for diagnosis, prevention and treatment of disorders according 
to the invention, in particular for HCC, liver adenoma, FNH and cirrhosis. With 
regard to the treatment it is preferred to carry out the treatment such that the ex- 
pression of the OBcl5.pr polypeptide or of the nucleic acid encoding the polypepti- 
de is reduced and/or inhibited, for example by administering antisense oligo- 
nucleotides or RNA interference molecules that specifically interact with the 
nucleic acid encoding the OBcl5.pr polypeptide. Alternatively the treatment may 
be carried out such that the activity of the OBclS.pr polypeptide is reduced and/or 
inhibited, for example by administering an antibody directed against the OBclS.pr 
polypeptide or an antibody fragment thereof which block the activity of the 
OBclS.pr polypeptide to a patient in need of such treatment. Compared to the state 
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of the art, this OBclS.pr polypeptide and/or OBclS nucleic acid surprisingly allow 
improved, more sensitive, earlier, faster, and/or non-invasive diagnosis and/or im- 
proved, sustained and/or more effective treatment- of the liver disorders and/or 
other epithelial cancers. 

In a preferred embodiment the nucleic acid according to the invention is the IK2 
(SEQ ID 12) is represented by the Gene Bank sequence NM_025160 which inclu- 
des the open reading frame encoding IK2.pr polypeptide (SEQ ID 3). The IK2.pr 
polypeptide is another embodiment of the invention, EST sequences corresponding 
to this clone have been reported in cDN A libraries from several tissues including 
liver and in adenocarcinomas, but the sequence has not previously been implicated 
in HCC. Expression of this polypeptide has not been described in any cell or tis- 
sue. The polypeptide sequence has no known function although the sequence is 
evolutionarily well conserved (predicted polypeptides are found in several mam- 
mals, fruit fly (Drosophila) and plants (Arabidopsis). The CDD algorithm predicts 
several WD40-type polypeptide-polypeptide interaction domains in this polypepti- 
de sequence according to the invention. In liver samples from HCC patients ex- 
pression of the mRNA encoding this polypeptide is surprisingly elevated relative 
to non-diseased liver by an average value of 4.67-fold in 15 of the 21 cases profi- 
led (71%). Elevated expression of the encoding mRNA relative to non-diseased 
liver is also evident in cirrhotic livers. Highest differential expression levels of the 
mRNA encoding this peptide relative to non-diseased liver were observed in FNH; 
8-fold upregulation in 4 of 4 cases profiled. The mRNA encoding this polypeptide 
is also expressed in several other human carcinomas including those of the 
mammary gland, lung and kidney, and in 2 (breast and kidney) of the 17 non- 
diseased human tissues examined. These results demonstrate that the overexpressi- 
on of this polypeptide and/or the encoding mRNA, can be utilized for the diagno- 
sis, prevention and treatment of disorders according to the invention, in particular 
for the diagnosis of HCC, FNH, cirrhosis, and epithelia-derived neoplasms. With 
regard to the treatment it is preferred to carry out the treatment such that the ex- 
pression of the IK2.pr polypeptide or of the nucleic acid encoding the polypeptide 
is reduced and/or inhibited, for example by administering antisense oligonucleoti- 
des or RNA interference molecules that specifically interact with the nucleic acid 
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encoding the IK2.pr polypeptide. Alternatively the treatment may be carried out 
such that the activity of the IK2.pr polypeptide is reduced and/or inhibited, for ex- 
ample by administering an antibody directed against the IK2.pr polypeptide or an 
antibody fragment thereof which block the activity of the IK2.pr polypeptide to a 
patient in need of such treatment Compared to the state of the art, this DC2.pr po- 
lypeptides and/or IK2 nucleic acid surprisingly allow improved, more sensitive, 
earlier, faster, and/or non-invasive diagnosis and/or improved, sustained and/or 
more effective treatment of the liver disorders and/or other epithelial cancers. 

In a preferred embodiment the nucleic acid according to the invention is. the IKS 
cDNA sequence (SEQ ID 13) which represents the sequence of an HCC deregula- 
ted cDNA clone. Expression of sequences corresponding to this clone has been 
reported in several tissues (including liver) and some tumors (including pituitary 
and prostate) but the sequence has not previously been described to be upregulated 
in HCC. hi a preferred embodiment the polypeptide according to the invention is 
the IKS.pr polypeptide (SEQ ID 4) which is encoded by the IKS cDNA sequence 
(SEQ ID 13). The polypeptide sequence is deduced from the GenBank database 
(Accession number: NMJ)06407) as JWA, a vitamin A responsive polypeptide. 
Although the gene encoding this putative polypeptide has been described from 
stimulation of cultured cells with vitamin A, the presence of the polypeptide has 
not been described in any cell or tissue and the function is unknown. JWA is 
further described as a cytoskeleton-associated polypeptide in the GenBank databa- 
se. The polypeptide shares homology also with rodent polypeptides that interact 
specifically with and may reduce the activity of the EAAC1 glutamate transporter. 
A conserved domain search of this sequence indicates the likely presence of a pre- 
nylated rab acceptor 1 domain (PRA1), possibly mediating interaction with G pro- 
tein signaling molecules. Expression of the mRNA encoding this polypeptide is 
elevated by an average of 9.14-fold relative to non-diseased liver in 100% of the 
HCC cases profiled. Similarly, elevated expression of the encoding mRNA is also 
evident in Adenoma and FNH. The encoding mRNA expression is differentially 
expressed also in cirrhotic livers but to a lesser extent than in the other liver disor- 
ders. The mRNA encoding this polypeptide is expressed in lung, kidney and colon 
human carcinomas but in just 1 of the 17 non-diseased human tissues examined. 
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Overexpression of this polypeptide and/or the encoding mRNA, may mark specific 
epithelia-derived neoplasms, including liver cancer. These results show that the 
differential upregulated expression of the IKS cDNA sequence is highly specific 
for disorders according to the invention. Therefore the DC5.pr polypeptide, and/or 
the encoding nucleic acid can he utilized for the diagnosis, prevention and treat- 
ment of disorders according to the invention, in particular for the diagnosis of 
HCC, Adenoma, FNH, cirrhosis, and epithelia-derived neoplasms. With regard to 
the treatment it is preferred to carry out the treatment such that the expression of 
the IK5.pr polypeptide or of the nucleic acid encoding the polypeptide is reduced 
and/or inhibited, for example by administering antisense oligonucleotides or RNA 
interference molecules that specifically interact with the nucleic acid encoding the 
IK5.pr polypeptide. Alternatively the treatment may be carried out such that the 
activity of the IKS.pr polypeptide is reduced and/or inhibited, for example by ad- 
ministering an antibody directed against the IKS.pr polypeptide or an antibody 
fragment thereof which block the activity of the IKS.pr polypeptide to a patient in 
need of such treatment. Compared to the state of the art, this IKS.pr polypeptide 
and/or IKS nucleic acid surprisingly allow improved, more sensitive, earlier, faster, 
and/or non-invasive diagnosis and/or improved, sustained and/or more effective 
treatment of the liver disorders and/or other epithelial cancers. 

In a preferred embodiment the nucleic acid according to the invention is the SEQ 
ID 14 which has been disclosed before (Accession, no. X83544) encoding the 
DAP3.pr polypeptide (SEQ ID 5). The invention further relates to the death asso- 
ciated polypeptide 3 (DAP3, SEQ ID 5) which has been implicated in promotion of 
apoptotic cell death when overexpressed in cultured cells (Kissil et al., 1995, J. 
Biol. Chem., 270:27932-6). This polypeptide has not been implicated, however, in 
disorders according to the invention, especially in liver disorders or HCC. The po- 
lypeptide contributes to the mitochondrial 28S ribosomal complex. As such, this 
polypeptide is likely to be ubiquitously expressed in many if not all tissues and 
cells, albeit apparently at relatively low levels. No specific function for endoge- 
nous DAP3 has not been described (Cadvar Koc et al., 2001, FEBS Lett., 492:166- 
170). Expression of the mRNA encoding this polypeptide is elevated an average of 
5.5-fold relative to non-diseased liver in 18 of the 21 HCC cases profiled (86%). 
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Elevated expression of the encoding mRNA is also evident in other liver disorders 
but to a lesser extent than in HCC. These results show that the strongly upregulated 
expression of the DAP3 cDNA sequence is highly specific for disorders according 
to the invention, especially in HCC. Therefore the DAP3 polypeptide and/or the 
encoding nucleic acid can be utilized for the diagnosis, prevention and treatment of 
disorders according to the invention, in particular for die diagnosis of HCC. With 
regard to the treatment it is preferred to carry out the treatment such that the ex- 
pression of the DAP3 polypeptide or of the nucleic acid encoding the polypeptide 
is reduced and/or inhibited, for example by adrninistering antisense oligonucleoti- 
des or RNA interference molecules that specifically interact with the nucleic acid 
encoding the DAP3 polypeptide: Alternatively the treatment may be carried out 
such that the activity of the DAP3 polypeptide is reduced and/or inhibited, for ex- 
ample by adrninistering an antibody directed against the DAP3 polypeptide or an 
antibody fragment thereof which block the activity of the DAP3 polypeptide to a 
patient in need of such treatment Compared to the state of the art, this DAP3 poly- 
peptide and DAP3 nucleic acid surprisingly allow improved, more sensitive, ear- 
lier, faster, and/or non-invasive diagnosis and/or improved, sustained and/or more 
effective treatment of the liver disorders and/or other epithelial cancers. 

In another preferred embodiment invention relates to the HCC up-regulated the . 
LOC5.pr hypothetical polypeptide (SEQ ID 6) and to the to the nucleic acid LOC5 
(SEQ ID 15) coding for the polypeptide. cDNA corresponding to this mRNA has 
been identified in cDNA libraries from several human tissues including liver (in- 
formation from SOURCE database as described above) but the sequence has not 
previously been reported to be up-regulated in disorders according to the invention, 
in particular in HCC. Expression of this mRNA is elevated 5-fold relative to non- 
diseased liver in 71% of the HCC cases profiled. Similar analysis reveals elevated 
expression of this mRNA in FNH and in a majority of cirrhotic livers subjected to 
this cDNA microarray expression profiling procedure. The mRNA is expressed in 
other human gastrointestinal tract carcinomas but only in brain and bone marrow 
of the 17 non-diseased human tissues examined. LOC5.pr (SEQ ID 6) is a predic- 
ted 30 kDa polypeptide (Accession number NP_060917.1 in the GenBank databa- 
se). The presence of this polypeptide has not been described in any cell or tissue. 
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No function has been described for this predicted polypeptide and no conserved 
domains are revealed from a search with the CDD domain algorithm. These results 
show that the strongly upregulated expression of the LOC5 cDNA sequence is hig- 
hly specific for disorders according to the invention, especially in HCC, FNH and 
in a majority of cirrhotic livers. Therefore the LOCS.pr polypeptide and/or the en- 
coding nucleic acid can be utilized for the diagnosis, prevention and treatment of 
disorders according to the invention, in particular for the diagnosis of in HCC, 
FNH, and a majority of cirrhotic livers. With regard to the treatment it is preferred 
to carry out the treatment such that the expression of the LOCS.pr polypeptide or 
of the nucleic acid encoding the polypeptide is reduced and/or inhibited, for ex- 
ample by administering antisense oligonucleotides or RNA interference molecules 
that specifically interact with the nucleic acid encoding the LOCS.pr polypeptide. 
Alternatively the treatment may be carried out such that the activity of the 
LOCS.pr polypeptide is reduced and/or inhibited, for example by administering an 
antibody directed against the LOCS.pr polypeptide or an antibody fragment thereof 
which block the activity of the LOC5.pr polypeptide to a patient in need of such 
treatment. Compared to the state of the art, this LOCS.pr polypeptide and/or LOC5 
nucleic acid surprisingly allow improved, more sensitive, earlier, faster, and/or 
non-invasive diagnosis and/or improved, sustained and/or more effective treatment 
of the liver disorders and/or other epithelial cancers. 

In a further preferred embodiment the invention relates to the SEC14L2 cDNA 
(SEQ ID 16) encoding the SEC14L2.pr polypeptide (SEQ ID No. 7) according to 
the invention. The expression of SEC14L2 mRNA, has been described in many 
tissues but elevation of this message or the encoded polypeptide has not been pre- 
viously reported in disorders according to the invention in particular not in liver 
disorders or cancer. SEC14L2.pr (SEQ ID 7) is a human homologue of the yeast 
sec polypeptide 14. Although implicated in the yeast secretory pathway, a clear 
function for this polypeptide or its homologues has not been described in any spe- 
cies. This human sequence has also been suggested to bind to tocopherol and it has 
been predicted that this polypeptide is involved in squalene transfer, cholesterol 
biosynthesis or more generally in intracellular transport (Zimmer et at, 2000, J. 
Biol. Chem. 275:25672-25680). Expression of this polypeptide sequence has not 
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been reported in human cells or tissues. The polypeptide sequence includes possi- 
ble G-polypeptide binding and phosphotidylinositol transfer domains and a con- 
sensus CRALJTRIO domain. The latterhas.been implicated in vitamin binding via 
the cis-retinal CRAL motif. The mRNA encoding this polypeptide is elevated and 
average of 5.14-fold or greater relative to non-diseased liver in 71% of HCC 
samples, in all FNH disease samples profiled, but not in adenoma in only one-half 
of cirrhosis samples. Expression of the mRNA encoding this polypeptide has been 
detected in kidney and colon carcinoma and in the normal pancreas but not in other 
normal tissues examined (Figure 25). These results show that the strongly upregu- 
lated expression of the SEC14L2 cDNA sequence is highly specific for disorders 
according to the invention, especially in HCC and FNH. Therefore the 
SEC14L2.pr polypeptide and/or the encoding nucleic acid can be utilized for the 
diagnosis, prevention and treatment of disorders according to the invention, in par- 
ticular for the diagnosis of HCC, FNH and preferably also in cirrhosis. With regard 
to the treatment it is preferred to carry out the treatment such that the expression of 
the SEC14L2.pr polypeptide or of the nucleic acid encoding the polypeptide is 
reduced and/or inhibited, for example by administering antisense oligonucleotides 
or RNA interference molecules that specifically interact with the nucleic acid en- 
coding the SEC14L2.pr polypeptide. Alternatively the treatment may be carried 
out such that the activity of the SEC14L2.pr polypeptide is reduced and/or inhibi- 
ted> for example by administering an antibody directed against the SEC14L2.pr 
polypeptide or an antibody fragment thereof which block the activity of the 
SEC14L2.pr polypeptide to a patient in need of such treatment Compared to the 
state of the art, this SEC14L2.pr polypeptide and/or SEC14L2 nucleic acid surpri- 
singly allow improved, more sensitive, earlier, faster, and/or non-invasive diagno- 
sis and/or improved, sustained and/or more effective treatment of the liver disor- 
ders, and/or other epithelial cancers. 

In a further preferred embodiment the invention relates to the a nucleic acid (SEQ 
ID 17) coding for the SSP29.pr or APRIL polypeptide, which have been described 
in many tissues and tumors. The gene encoding this putative tumor necrosis family 
member has not previously been reported to be expressed at elevated levels in di- 
sorders according to the invention, in particular in HCC. Furthermore the invention 



-31 - 

relates to the silver stainable 29 kDa polypeptide (SSP29.pr; SEQ ID 8) which is 
encoded by the nucleic acid (SEQ ID 17) according to the invention. The polypep- 
tide has been identified as a leucine rich secreted polypeptide, likely belonging the 
TNF cytokine family* It is also known as APRIL (acidic polypeptide rich in leuci- 
nes) and contains leucine rich repeats (LRRs) near the N-terminus that may be in- 
volved in antigen-mediated cellular responses. (Zhu et al., 1997, Biochem. MoL 
Biol. Int. 42:927-935; Mencinger et al., 1998, Biochim. Biophys. Acta 1395: 176- 
180). Expression of the SSP29.pr polypeptide has not been reported in human cells 
or tissues. The mRNA encoding this polypeptide is elevated an average of 3.77- 
fold relative to non-diseased liver in 17 of 21 HCCs profiled. Surprisingly, the 
level of the mRNA encoding this polypeptide is 30-fold higher in cirrhosis caused 
by copper toxicity than in a pool of non-diseased liver. mRNA levels are margi- 
nally elevated in other liver disorders profiled relative to non-diseased liver and 
this mRNA is otherwise detected only infrequently in the normal and diseased tis- 
sues subjected here to expression profiling. These results show that the strongly 
upregulated expression of the SSP29 cDNA sequence is highly specific for disor- 
ders according to the invention, especially in HCC, and certain types of cirrhosis 
disease. Therefore the SSP29.pr polypeptide and/or the encoding nucleic acid can 
be utilized for the diagnosis, prevention and treatment of disorders according to the 
invention, in particular for the diagnosis of HCC and cirrhosis. With regard to the 
treatment it is preferred to carry out the treatment such that the expression of the 
SSP29.pr polypeptide or of the nucleic acid encoding the polypeptide is reduced 
and/or inhibited, for example by administering antisense oligonucleotides or RNA 
interference molecules that specifically interact with the nucleic acid encoding the 
SSP29.pr polypeptide. Alternatively the treatment may be carried out such that the 
activity of the SSP29.pr polypeptide is reduced and/or inhibited, for example by 
administering an antibody directed against the SSP29-pr polypeptide or an antibo- 
dy fragment thereof which block the activity of the SSP29.pr polypeptide to a pati- 
ent in need of such treatment. Compared to the state of the art, this SSP29.pr poly- 
peptide and/or SSP29 nucleic acid surprisingly allow improved, more sensitive, 
earlier, faster, and/or non-invasive diagnosis and/or improved, sustained and/or 
more effective treatment of the liver disorders, and/or other epithelial cancers. 
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In another preferred embodiment the invention relates to the HS16 nucleic acid 
(SEQ ID 18). cDNA clones corresponding to this message have been identified in 
several tissues including adenocarcinoma* of the colon but neither this mRNA nor 
the encoded polypeptide (HS16.pr, SEQ ID 9) have been previously implicated in 
disorders according to the invention, in particular in liver disorders or in HCC. The 
invention further relates to the polypeptide encoding for the HS16.pr is a predicted 
polypeptide of 16.7 kDa (SEQ ID 9; Accession number NP_057223 in the Gen- 
Bank database). The presence of the polypeptide has not been described in any cell 
or tissue and its function has not been described nor are functional domains identi- 
fied with the CDD algorithm. mRNA encoding this polypeptide is elevated at least 
2:8-fold or higher in 8 of the HCCs examined and by nearly 2-fold in aid additional 
4 HCC samples examined, all relative to non-diseased liver. These results show 
that the strongly upregulated expression of the HS16 cDNA sequence is highly 
specific for disorders according to the invention, especially in HCC. Therefore the 
HS16.pr polypeptide and/or the encoding nucleic acid can be utilized for the dia- 
gnosis, prevention and treatment of disorders according to the invention, in parti- 
cular for the diagnosis of HCC . With regard to the treatment it is preferred to carry 
out the treatment such that the expression of the HS16.pr polypeptide or of the 
nucleic acid encoding the polypeptide is reduced and/or inhibited, for example by 
administering antisense oligonucleotides or RNA interference molecules that spe- 
cifically interact with the nucleic acid encoding the HS16.pr polypeptide. Alterna- 
tively the treatment may be carried out such that the activity of the HS16.pr poly- 
peptide is reduced and/of inhibited, for example by administering an antibody di- 
rected against the HS16.pr polypeptide or an antibody fragment thereof which 
block the activity of the HS16.pr polypeptide to a patient in need of such treatment. 
Compared to the state of the art, this HS16.pr polypeptide and/or HS16 nucleic 
acid surprisingly allow improved, more sensitive, earlier, faster, and/or non- 
invasive diagnosis and/or improved, sustained and/or more effective treatment of 
the liver disordersand/or other epithelial cancers. 

In another preferred embodiment the invention relates to the IK3 nucleic acid 
(SEQ ID 19) which is a highly deregulated mRNA associated with HCC but wi- 
thout a clear open reading frame to encode a corresponding polypeptide, Surpri- 
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singly the sequence from this mRNA is represented at much higher levels in HCC 
than in normal human liver. The over-expressed sequence identified here in HCC 
corresponds to a fetal brain cDNA in the GenBank database (AL049338). Otherwi- 
se there is no information regarding expression of this gene in normal or diseased 
tissues. This sequence may be extended to identify a corresponding polypeptide 
encoding open reading frame. Alternatively, the provided database sequence may 
include an error(s) that account for the lack of an open reading frame. Yet another 
alternative is that the encoded polypeptide may result from one of the small open 
reading frames in this sequence. Even further, this KNA may be not translated into 
polypeptide but may have functional (e.g., regulatory) properties itself. This 
mRNA is elevated an average of 3.81-fold or more relative to non-diseased liver in 
12 of the 21 HCC samples profiled (57%). HC3 is also elevated 2-fold or more re- 
lative to non-diseased liver in 3 of 4 FNH examined, in adenoma and in 5 of the 6 
cirrhosis samples examined. These results show that the strongly upregulated ex- 
pression of the IK3 cDNA sequence is highly specific for disorders according to 
the invention, especially in HCC, FNH, adenoma and cirrhosis. Therefore the IK3 
polypeptide and/or the encoding nucleic acid can be utilized for the diagnosis, pre- 
vention and treatment of disorders according to the invention, in particular for the 
diagnosis of in HCC, FNH, adenoma and cirrhosis. With regard to the treatment it. 
is preferred to carry out the treatment such that the expression of the polypeptide 
encoded by the IK3 or of the IK3 nucleic acid is reduced and/or inhibited, for ex- 
ample by administering antisense oligonucleotides or RNA interference molecules 
that specifically interact with the IK3 nucleic acid. Alternatively the treatment may 
be carried out such that the activity of the IK3 polypeptide is reduced and/or inhi- 
bited, for example by administering an antibody directed against the IK3 polypep- 
tide or an antibody fragment thereof which block the activity of the IK3 polypepti- 
de to a patient in need of such treatment. Compared to the state of the art, this IK3 
nucleic acid surprisingly allows improved, more sensitive, earlier, faster, and/or 
non-invasive diagnosis and/or improved, sustained and/or more effective treatment 
of the liver disorders and/or other epithelial cancers. 



The Accession numbers of the polypeptides according to the invention and their 
cDNAs are shown in Figure 1 . 
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In another preferred embodiment of the invention the nucleic acid according to the 
invention can be used for the construction of antisense oligonucleotides (Zheng and 
Kemeny, 1995, Clin. Exp. Immunol 100: 380-2; Nellen and Lichtenstein, 1993, 
Trends Biochem. Sci. 18: 419-23; Stein, 1992, Leukemia 6: 967-74) and/or ribo- 
zymes (Amarzguioui, et al. 1998, Cell. Mol. Life Sci. 54: 1175-202; Vaish et al., 
1998, Nucleic Acids Res. 26: 5237-42; Persidis, 1997, Nat Biotechnol. 15: 921-2; 
Couture and Stinchcomb, 1996, Trends Genet. 12: 510-5) and/or small interfering 
double stranded RNAs (Elbashir et al., 2001, Nature 411: 494-98; Brummelkamp 
et al., 2002, Science 296:550-553). In further preferred embodiments of the inven- 
tion, the stability of the nucleic acid according to the invention can be decreased 
and/or the translation of the nucleic acid according to the invention inhibited by 
using RNA interference molecules (oligonucleotides). Thus, for example, the ex- 
pression of the corresponding genes in cells can be decreased both in vivo and in 
vitro . Oligonucleotides can therefore be suitable as therapeutics. This strategy is 
also suitable, for example, for liver cells, in particular if the antisense oligonucleo- 
tides are complexed with liposomes. For use as a probe or as an " antisense" oligo- 
nucleotide, a single-stranded DN A or RNA is preferred. 

In a preferred embodiment a nucleic acid according to the invention has been pre- 
pared by recombinant methods, by screening a library or isolation from a sample 
obtained from a patient or a subject. In another preferred embodiment of the inven- 
tion the nucleic acid according to the invention has been prepared synthetically. 
Thus, the nucleic acid according to the invention can be synthesized, for example, 
chemically with the aid of the DNA sequences described in SEQ ID No. 10 to SEQ 
ID No. 19 and/or with the aid of the protein sequences described in SEQ ID No.l 
to SEQ ID No. 9 with reference to the genetic code, e.g. according to the phos- 
photriester method (see, for example, Uhlmann and Peyman, 1990, Chemical Re- 
views 90:543-584). 

In another preferred embodiment, the invention relates to a nucleic acid according 
to the invention or a nucleic acid which is a non-functional mutant variant the 
nucleic acid or a nucleic acid having a sequence complementary to one of the afo- 
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rementioned nucleic acids, which has been modified by attachment of chemical 
moieties to the nucleic acid to stabilize it against degradation, so that a high con- 
centration of the nucleic acid is maintained in lhe cell over a long period (Beigel- 
man et aL, 1995, Nucleic Acids Res, 23: 3989-94; Dudycz, 1995, WO 95/11910; 
Macadam et aL, 1998, WO 98/37240; Reese et aL, 1997, WO 97/29116). Typical- 
ly, such a stabilization can be obtained by the introduction of one or more inter- 
nucleotide phosphorus groups or by the introduction of one or more non- 
phosphorus internucleotides. 

Preferred suitable modified internucleotides are summarized in Uhlmann and Pey- 
mann (1990 Chem. Rev. 90, 544; see also Beigelman et aL, 1995 Nucleic Acids 
Res. 23: 3989-94; Dudycz, 1995, WO 95/11910; Macadam et aL, 1998, 
WO 98/37240; Reese et aL, 1997, WO 97/291 16). 

In a further embodiment the invention relates to a vector containing a nucleic acid 
according to the invention or a nucleic acid which is a non-functional mutant vari- 
ant the nucleic acid or a nucleic acid having a sequence complementary to one the 
aforementioned nucleic acids. Preferably the vector is a knock-out gene construct, 
a plasmid, a shuttle vector, a phagemid, a cosmid, a viral vector, an expression 
vector and/or a vector applicable in gene therapy. 

The preparation of such constructs is generally known to the person skilled in the 
art. 

An "expression vector" within the meaning of the present invention, preferably 
comprises at least one promoter or enhancer, i.e. at least one regulatory element 
comprising at least one translation initiation signal, at least one of the nucleic acids 
according to the invention or a nucleic acid which is a non-functional mutant vari- 
ant the nucleic acid or a nucleic acid having a sequence complementary to one of 
the aforementioned nucleic acids, one translation termination signal, a transcription 
termination signal, and a polyadenylation signal for the expression in eukaryotes. 
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For the expression of the gene concerned, in general a double-stranded DNA is 
preferred, the DNA region coding for the polypeptide being particularly preferred. 
In the case of eukaryotes this region begins with the first start codon (ATG) lying 
in a Kozak sequence (Kozak, 1987, Nucleic. Acids Res. 15: 8125-48) up to the 
next stop codon (TAG, TGA or TAA), which lies in the same reading frame to the 
ATG. In the case of prokaryotes this region begins with the first AUG (or GUG) 
after a Shine-Dalgarno sequence and ends with the next stop codon (TAA TAG or 
TGA), which lies in the same reading frame to the ATG. 

Differentially expressed genes in HCC can contain cancer or liver or liver cancer 
gene-specific regulatory sequences. These non-transcribed sequences, found in the 
tissue- or disease-specific gene may be used to drive tissue- or disease-specific 
expression of included therapeutic and/or tumor cell-cytotoxic genes. These regu- 
latory sequences may be used for liver cancer specific expression of a nucleic acid 
according to the invention or a nucleic acid which is a non-functional mutant vari- 
ant the nucleic acid or a nucleic acid having a sequence complementary to one of 
the aforementioned nucleic acids. The screening and construction of such regula- 
tory a sequences is generally known to the person skilled in the art. 

Suitable expression vectors can be prokaryotic or eukaryotic expression vectors. 
Examples of prokaryotic expression vectors are, for expression in E coli, e.g. the 
vectors pGEM or pUC derivatives, examples of eukaryotic expression vectors are 
for expression in Saccbaromyces cerevisiae, e.g. the vectors p426Met25 or 
p426GALl (Mumberg et al. (1994) Nucl. Acids Res., 22, 5767-5768), for expres- 
sion in insect cells, e.g. Baculovirus vectors such as disclosed in EP-B1-0 127 839 
or EP-B1-0 549 721, and for expression in mammalian cells, e.g. the vectors 
Rc/CMV and Rc/RSV or SV40 vectors, which are all generally obtainable. Speci- 
fic vectors for production of RNA interference following transfection, such as the 
pSUPER vector (Brummelkamp et al., 2002, Science 296:550-553) are also inclu- 
ded. 
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In general, the expression vectors also contain promoters suitable for the respective 
cell, such as, for example, the tip promoter for expression in E. coli (see, for exam- 
ple, EP-B1-0 154 133), the MET 25, GAL 1 or ADH2 promoter for expression in 
yeast (Russel et aL (1983), J. Biol. Chem. 258, 2674-2682; Mumberg, supra), the 
Baculovirus polyhedrin promoter, for expression in insect cells (see, for example, 
EP-B1-0 127 839), For expression in mammalian cells, for example, suitable pro- 
moters are those which allow a constitutive, regulatable, tissue-specific, cell-cycle- 
specific or metabolically specific expression in eukaryotic cells. Regulatory ele- 
ments according to the present invention preferably are promoters, activator se- 
quences, enhancers, silencers and/or repressor sequences. 

Examples of suitable regulatory elements which make possible constitutive expres- 
sion in eukaryotes preferably are promoters which are recognized by the RNA po- 
lymerase m or viral promoters, CMV enhancer, CMV promoter, SV40 promoter or 
LTR promoters, e.g. from MMTV (mouse mammary tumor virus; Lee et aL (1981) 
Nature 214, 228-232) and further viral promoter and activator sequences, derived 
from, for example, adeno- and adeno-like viruses, HBV, HCV, HSV, HPV, EBV, 
HTLVorHTV. 

Examples of regulatory elements which make possible regulated expression in 
eukaryotes are the tetracycline operator in combination with a corresponding re- 
pressor (Gossen et aL, 1 994, Curr. Opin. BiotechnoL 5:51 6-20). 

Translation initiation signals, translation termination signals, transcription termina- 
tion signals, and polyadenylation signals are generally known to the person skilled 
in the art and can be readily obtained from commercial laboratory suppliers. 

Preferably, the expression of the genes relevant for liver disorders and/or epithelial 
cancer takes place under the control of tissue-specific promoters, for example, un- 
der the control of liver-specific promoters such as albumin, alpha fetoprotein, apo- 
lipoprotein AI, alpha- 1 antitrypsin, and the complement C5 and C8A genes 
(Schrem et aL, 2002, Pharmacol. Rev. 54 129-58; Pontoglio et aL, 2001, J. Expt 
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Med. 194:1683-1689). The regulatory sequences associated with genes highly de- . 
regulated in HCC as described herein also provide a preferable method for specific 
gene expression in these disorders. - 

Further examples of regulatory elements which make tissue-specific expression in 
eukaryotes possible are promoters or activator Sequences from promoters or enhan- 
cers of those genes which code for proteins which are only expressed in certain cell 
types. 

Examples of regulatory elements which make possible metabolically specific ex- 
pression in eukaryotes are promoters which are regulated by hypoxia, by oxidative 
stress, by glucose deficiency, by phosphate concentration or by heat shock. 

Examples of regulatory elements which make cell cycle-specific expression in eu- 
karyotes possible are promoters of the following genes: cdc25A, cdc25B, cdc25C, 
cyclin A,.cyclin E, cdc2, E2F-1 to E2F-5, B-myb or DHFR (Zwicker J. and Mtiller 
R.,1997, Trends Genet. 13: 3-6). The use of cell cycle regulated promoters is parti- 
cularly preferred in cases, in which expression of the polypeptides or nucleic acids 
according to the invention is to be restricted to proliferating cells. 

In order to make possible the introduction of nucleic acids as described above or a 
nucleic acid which is a non-functional mutant variant the nucleic acid and thus the 
expression of the polypeptide in a eu- or prokaryotic cell by transfection, transfor- 
mation or infection, the nucleic acid can be present as a plasmid, as part of a viral 
or non-viral vector. Suitable viral vectors here are particularly: baculoviruses, vac- 
cinia viruses, adenoviruses, adeno-associated viruses, retroviruses and herpesvi- 
ruses. Suitable non-viral vectors here are particularly: virosomes, liposomes, cati- 
onic lipids, or polylysine-conjugated DNA or naked DNA. 
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Plasmids, shuttle vectors, phagemids, and cosmids suitable for use according to the 
invention are also known to the person skilled in the art and are generally obtain- 
able from commercial laboratory suppliers. 

Examples of vectors applicable in gene therapy are virus vectors, for example ade- 
novirus vectors, retroviral vectors or vectors based on replicons of RNA viruses 
(Lindemann et al., 1997, Mol. Med. 3: 466-76; Springer et al., 1998, MoL Cell. 2: 
549-58, Khromykh, 2000, Curr. Opin. Mol Ther. 2:555-569). Eukaryotic expressi- 
on vectors are suitable in isolated form for gene therapy use, as naked DNA can 
penetrate, for example, into liver cells upon local application or via the blood 
supply* 

Compared to the state of the art, this fusion construct surprisingly allows improved, 
more sensitive, earlier, faster, and/or non-invasive diagnosis and/or improved, su- 
stained and/or more effective treatment of the liver disorders, and/or other epitheli- 
al cancers. 

In another aspect the invention furthermore relates to a cell containing a nucleic 
acid according to the invention. Preferably the cell is transformed with a vector 
according to the invention. The cell preferably contains a nucleic acid wherein the 
nucleic acid is either a non-functional mutant variant of a nucleic acid according to 
the invention or a variant thereof. In particular the cell contains vector containing a 
nucleic acid wherein the nucleic acid is a non-functional mutant variant of a 
nucleic acid according to the invention or a variant thereof. Perferably the cell 
contains a nucleic acid coding for a nucleic acid having a sequence complementary 
to a nucleic acid according to the invention or a variant thereof. Moreover the cell 
preferably contains a vector containing a nucleic acid coding for an antibody ac- 
cording to the invention or a fragment of the antibody. The cell according to the 
invention, may for example be a liver cell, containing at least one of the aforemen- 
tioned nucleic acids or a cell which is transformed using one of the above described 
vectors. Cells can be either prokaryotic or eukaryotic cells, heterologous or auto- 
logous cells. Examples of prokaryotic cells are R coli and examples of eukaryotic 



-40- 

cells include primary hepatocytes cells, hepatocytes cell lines such as HepG2 cells, 
yeast cells, for example Saccharomyces cerevisiae or insect cells. 

Compared to the state of the art, the cell according to the invention surprisingly 
allows improved, more sensitive, earlier, faster, and/or non-invasive diagnosis 
and/or improved, sustained and/or more effective treatment of the liver disorders 
and/or other epithelial cancers. 

In a preferred embodiment of the invention the cell is a transgenic embryonic 
non-human stem cell which comprises at least one nucleic acid according to the 
invention, at least one vector, at least one knock-out gene construct and/or at least 
one expression vector as described above. 

Processes for the transformation of cells and/or stem cells are well known to a per- 
son skilled in the art and include, for example, electroporation or microinjection. 

In another aspect the invention relates to the provision of a transgenic non-human 
mammal containing a compound selected from the group consisting of a nucleic 
acid according to the invention, a nucleic acid which is a non-functional mutant 
variant the nucleic acid, a nucleic acid having a sequence complementary to one of 
the aforementioned nucleic acids, one of the aformementioned nucleic acids in the 
form of a vector, of a knock-down or knock-out gene construct, and of an expressi- 
on vector. 

Transgenic animals in general show a tissue-specifically increased expression of 
the nucleic acids and/or polypeptides and can be used for the analysis of liver di- 
sorders and/or epithelial cancers, such as for example HCC, and for development 
and evaluation of therapeutic strategies for such disorders. Transgenic animals may 
further be employed in the production of polypeptides according to the invention. 
The polypeptide produced by the animal may for example be enriched in a body 
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fluid of the animal. The polypeptides according to the invention may for example 
be isolatable from a body fluid such as the milk. 

Compared to the state of the art, this transgenic non-human mammal surprisingly 
allows improved, more sensitive, earlier, faster, and/or non-invasive analysis 
and/or diagnosis of a liver disorders and/or other epithelial cancers. 

Processes for the preparation of transgenic animals, in particular of transgenic 
mice, are likewise known to the person skilled in the art from DE 196 25 049 and 
US 4,736,866; US 5,625,122; US 5,698,765; US 5,583,278 and US 5,750,825 and 
include transgenic animals which can be produced, for example, by means of direct 
injection of expression vectors according to the invention into embryos or sperma- 
tocytes or by injection of the expression vectors into the pronucleus of the fertilized 
ovum or by means of the transfection of expression vectors into embryonic stem 
cells or by nuclear transfer into appropriate recipient cells (Polites and Pinkert, 
DNA Microinjection and Transgenic Animal Production, page 15 to 68 in Pinkert, 
1994, Transgenic animal technology: a laboratory handbook, Academic Press, 
London, UK; Houdebine, 1997, Harwood Academic Publishers, Amsterdam, The 
Netherlands; Doetschman, Gene Transfer in Embryonic Stem Cells, page 115 to 
146 in Pinkert, 1994, supra; Wood, Retrovirus-Mediated Gene Transfer, page 147 
to 176 in Pinkert, 1994, supra; Monastersky, Gene Transfer Technology; Alternati- 
ve Techniques and Applications, page 177 to 220 in Pinkert, 1994, supra). 

If the above described nucleic acids are integrated into so-called "targeting vectors" 
or "knock-out" gene constructs (Pinkert, 1994, supra), it is possible after transfec- 
tion of embryonic stem cells and homologous recombination, for example, to gen- 
erate knock-out mice which, in general, as heterozygous mice, show decreased 
expression of the nucleic acid, while homozygous mice no longer exhibit expres- 
sion of the nucleic acid. The animals thus produced can also be used for the analy- 
sis of liver disorders, such as for example HCC, and/or epithelial cancers. 
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Knock-out gene constructs are known to the person skilled in the art, for example, 
from the US patents 5,625,122; US 5,698,765; US 5,583,278 and US 5,750,825. 

In a further aspect the invention relates to an antibody or a fragment thereof is pro- 
vided, wherein the antibody or antibody fragment is directed against a polypeptide 
according to the invention, a funtional variant thereof or against a nucleic acid co- 
ding for the polypeptide or a variant thereof. 

Compared to the state of the art, these antibody or a fragment thereof surprisingly 
allow improved, more sensitive, earlier, faster, and/or non-invasive diagnosis 
and/or improved, sustained and/or more effective treatment of the liver disorders 
and/or other epithelial cancers. 

The term "antibody" or "antibody fragment" is understood according to the present 
invention as also meaning antibodies or antigen-binding parts thereof prepared by 
genetic engineering and optionally modified, such as, for example, chimeric anti- 
bodies, humanized antibodies, multifunctional antibodies, bi- or oligospecific anti- 
bodies, single-stranded antibodies, F(ab) or F(ab>2 fragments (see, for example, EP- 
Bl-0 368 684, US 4,816,567, US 4,816,397, WO 88/01649, WO 93/06213, 
WO 98/24884). The antibodies according to the invention can for example be used 
for diagnosis, prevention and/or treatment of disorders according to the invention 
such as liver disorders, for example HCC, and/or epithelial cancers. 

The invention further relates to a method for producing an antibody or antibody 
fragment, preferably a polyclonal or monoclonal antibody, specific for the poly- 
peptides and the polypeptides encoded by the nucleic acids according to the inven- 
tion for example for the diagnosis and/or prevention and/or treatment of disorders 
according to the invention. The process is carried out according to methods gene- 
rally known to the person skilled in the art by immunizing a mammal, for example 
a rabbit, with a nucleic acid according to the invention, or with a polypeptide ac- 
cording to the invention or parts thereof having at least 6 amino acid length, pre- 
ferably having at least 8 amino acid length, in particular having at least 12 amino 
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acid length, if appropriate in the presence o£ for example, Freund's adjuvant 
and/or aluminum hydroxide gels (see, for example, Harlow and Lane, 1998, Using 
Antibodies: A Laboratory Manual, Cold Spring Harbor Press, New York, USA, 
Chapter 5, pp. 53-135). The polyclonal antibodies formed in the animal as a result 
of an immunological reaction can then be easily isolated from the blood according 
to generally known methods and purified, for example, by means of column chro- 
matography. Monoclonal antibodies can be produced, for example, according to 
the known method of Winter & Milstein (Winter and Milstein, 1991, Nature 
349:293-299). 

The present invention further relates to an antibody or antibody fragments directed 
against a polypeptide described above and reacts specifically with the polypeptides 
described above, where the above-mentioned parts of the polypeptide are either 
immunogenic themselves or can be rendered immunogenic by coupling to suitable 
carriers, such as, for example, bovine serum albumin or keyhole limpet hemocya- 
nin to increase in their immunogenicity. This antibody is either polyclonal or mo- 
noclonal, preferably it is a monoclonal antibody. 

Still further, the present invention relates to the generation and/or production of an 
antibody or antibody fragment specific for the polypeptide according to the inven- 
tion from a recombinant antibody expression library, such as for example described 
by Knappik et al. (2000, J. Molec. Biol. 296:57-86) or by Chadd and Chamow 
(2001 Curr. Opin. BiotechnoL 12:188-94). 

In another embodiment of the invention, it is provided an array, wherein the array 
contains at least two compounds selected from the group consisting of a polypepti- 
de according to the invention, a functional variant thereof, a nucleic acid encoding 
the polypeptide, a nucleic acid according to SEQ ID No. 19, a non-functional 
mutant variant the nucleic acid and an antibody or an antibody fragment directed 
against the polypeptide. Alternatively, the array may contain at least 1 component 
according to the invention in combination with previously described components 
implicated in neoplastic or metabolic liver disorders or epithelial cancers. 
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Within the meaning of the invention the term "array" refers to a solid-phase or gel- 
like carrier upon which at least two compounds are attached or bound in one-, two- 
or three-dimensional arrangement Such arrays are generally known to the person 
skilled in the art and are typically generated on glass microscope slides, specially 
coated glass slides such as polycation-, nitrocellulose- or biotin- coated slides, co- 
ver slips, and membranes such as for example membranes based on nitrocellulose 
or nylon. 

The aforementioned arrays include bound polypeptides according to the invention 
or functional variants thereof or nucleic acids coding for the polypeptides or vari- 
ants thereof fusion proteins according to the invention or antibodies or antibody 
fragments directed against polypeptides according to the invention or functional 
variants thereof or cells expressing polypeptides according to the invention or 
functional variants thereof or at least two cells expressing at least one nucleic acid 
according to the invention or variants thereof. Nucleic acids coding for these or 
variants thereof can also be part of an array. Such arrays can be employed for ana- 
lysis and/or diagnosis of liver disorders, preferably of HCC; and/or epithelial can- 
cer. 

The invention further relates to a method of producing arrays according to the in- 
vention, wherein at least two compounds according to the invention are bound to a 
carrier material. 

Methods of producing such arrays, for example based on solid-phase chemistry and 
photo-labile protective groups are generally known (US 5,744,305). Such arrays 
can also brought into contact with substances or a substance libraries and tested for 
interaction, for example for binding or change of conformation. 

The invention further relates to a process for preparing an array immobilized on a 
support material for analysis and/or diagnosis of disorders according to the inventi- 
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on such as a liver disorder, preferably of HCC, in which at least two nucleic acid, 
at least two polypeptide or at least two antibody or antibody fragment, and/or at 
least two cell, or at least one of the aforementioned components in combination 
with other components relevant to neoplastic and metabolic liver disorders or epit- 
helial cancers, as described above, is used for preparation. The arrays produced by 
such process can be employed for the diagnosis of disorders according to the in- 
vention. 

In another aspect the invention relates to a diagnostic contains at least one com- 
pound selected from the group consisting of a polypeptide according to the inventi- 
on or a functional variant thereof, a nucleic acid encoding the polypeptide, pre- 
ferably a nucleic acid according to SEQ ID No. 10 - 18 or or a nucleic acid coding 
for the SEQ ID No. 19 or a variant of one of the aformementioned nucleic acids, 
and an antibody or an antibody fragment according to the invention, combined or 
together with suitable additives or auxiliaries. 

Compared to the state of the art, this diagnostic surprisingly allow improved, more 
sensitive, earlier, faster, and/or non-invasive diagnosis of liver disordersand/or 
other epithelial cancers. 

Within the meaning of the invention "suitable additives" or "auxiliaries" are gene- 
rally known to the person skilled in the art and comprise, for example, physiologi- 
cal saline solution, demineralized water, gelatin or glycerol-based protein stabili- 
zing reagents. Alternatively, the nucleic acid or polypeptide according to the in- 
vention may be lyophilized for stabilization. 

In another example a diagnostic kit based on the nucleic acid sequences according 
to the invention could be generated. Such a kit may be designed specifically to 
detect cells altered as a result of the described disorders resident in the circulatory 
system and thereby detectable in serum from test patients. Additional examples of 
diagnostic kits includes enzyme linked immunosorbent assays (ELISA), radioim- 
munoassays (RIA), and detection of an immune reaction or specific antibodies to 
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the polypeptides according to the invention including detection of specific respon- 
ding immune cells. 

In a preferred embodiment the diagnostic according to the invention contains a 
probe, preferentially a DNA probe. 

For example, it is possible according to the present invention to prepare a diagno- 
stic based on the polymerase chain reaction (PCR). Under defined conditions, pre- 
ferably using primers specific for a nucleic acid according to the invention as a 
DNA probe PCRs specific for the nucleic acid sequences of the invention will be 
utilized to monitor both the presence, and especially the amount, of specific nucleic 
acids according to the invention in a sample isolated from a patient obtained for 
diagnostic or therapeutic purposes. This opens up a further possibility of obtaining 
the described nucleic acids, for example by isolation from a suitable gene or cDNA 
library, for example from a liver disorder-specific or liver specific gene bank, with 
the aid of a suitable probe (see, for example, J. Sambrook et al., 1989, Molecular 
Cloning. A Laboratory Manual 2nd edn., Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY Chapter 8 pages 8: 1 to 8.81, Chapter 9 pages 9.47 to 9.58 and 
Chapter 10 pages 10.1 to 10.67). 

Suitable probes are, for example, DNA or RNA fragments having a length of about 
50-1000 nucleotides, preferably having a length of about 10 to about 100 nucleoti- 
des, preferably about 100 to about 200 nucleotides, in particular having a length of 
about 200-500 nucleotides, whose sequence can be derived from the polypeptides 
according to SEQ ID No. 1 to SEQ ID No. 9 and functional variants thereof and 
nucleic acids coding for the polypeptides, preferably according to SEQ ID No. 10 
to SEQ ID No. 19 and variants thereof. 

Alternatively, it is preferably possible with the aid of the derived nucleic acid se- 
quences to synthesize oligonucleotides which are suitable as primers for a polyme- 
rase chain reaction. Using this, the nucleic acid described above or parts of this can 
be amplified and isolated from cDNA, for example HCC-specific cDNA. Suitable 
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primers are, for example, DNA fragments having a length of about 10 to 100 
nucleotides, preferably having a length of about 15 to 50 nucleotides, in particular 
having a length of 17 to 30 nucleotides, whose sequence can be derived from the 
polypeptides according to SEQ ID No. 1 to SEQ ID No, 9 or from the nucleic acids 
according to SEQ ID No- 10 to SEQ ID No. 19. The design and synthesis of such 
primers is generally known to the person skilled in the art. The primers may addi- 
tionally contain restriction sites, e.g. suitable for integration of the amplified se- 
quence into vectors, or other adapters or overhang sequences, e.g. having a marker 
molecule such as a fluorescent marker attached, generally known to the skilled 
worker. 

In another aspect of the invention it is provided a method of diagnosis of a disorder 
according to the invention, wherein at least one compound selected from the group 
consisting of a polypeptide according to the sequence of SEQ ID No. 1 to SEQ ID 
No. 9, a functional variant thereof a nucleic acid encoding the polypeptide, a 
nucleic acid according to SEQ ID No. 19, a variant of one of the aforementioned 
nucleic acids, and an antibody directed against the polypeptide or antibody frag- 
ment thereof, is identified in the sample of a patient and compared with at least one 
compound of a reference library or of a reference sample. 

In a preferred embodiment of the method the disorder of the liver is a disorder se- 
lected from the group consisting of cirrhosis, alcoholic liver disease, chronic hepa- 
titis, Wilson's Disease, heamochromatosis, hepatocellular carcinoma, benign liver 
neoplasms, and focal nodular hyperplasia. 

In a preferred embodiment of the method the epithelial cancer is an adenocarcino- 
ma of any organ other than liver, preferably of an organ selected from the group 
consisting of the lung, the stomach, the kidney, the colon, the prostate, the skin, 
and the breast 
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Compared to the state of the art, this diagnostic surprisingly allows improved, more 
sensitive, earlier, faster, and/or non-invasive diagnosis of the liver disorders and/or 
other epithelial cancers. 

Preferably the sample is isolated from a patient by non-invasive methods as descri- 
bed above. 

For example, serum detection of specific deregulated gene proteins via ELISA 
assay is one application, alternatively one or a panel of antibodies to deregulated 
gene products from which a diagnostic score is deduced based on the combinations 
of, and expression levels of gene products expressed in the diseased tissue or in 
serum from diseased individuals. 

A preferred diagnostic according to the invention contains the described polypepti- 
de or the immunogenic parts thereof described in greater detail above. The poly- 
peptide or the parts thereof, which are preferably bound to a solid phase, e.g. of 
nitrocellulose or nylon, can be brought into contact in vitro, for example, with the 
body fluid to be investigated, e.g. blood, serum, plasma, ascitic fluid, pleural effu- 
sion, cerebral spinal fluid, saliva, urine, semen, in order thus to be able to react, for 
example, with autoimmune antibodies present in e.g. the blood of the patient. The 
antibody-peptide complex can then be detected, for example, with the aid of labe- 
led antihuman IgG antibodies. The labeling involves, for example, an enzyme, such 
as peroxidase, which catalyses a color or chemiluminescent reaction. The presence 
and the amount of autoimmune antibody present can thus be detected easily and 
rapidly by means of the color. 

In addition the diagnostic may be directed to detecting an endogenous antibody or 
fragment thereof present in the sample isolated from a patient which antibody or 
fragment thereof is directed against a polypeptide according to the invention. De- 
tection of such autoimmune antibodies may be accomplished by methods generally 
known to the skilled artisan, e.g. by immunoaffinity assays using polypeptides ac- 
cording to the invention or functional variants thereof or parts thereof as a probe. 



-49- 

Preferably the presence of such autoimmune antibodies is indicative of the patient 
suffering from a disorder according to the invention. 

A further diagnostic, that is subject matter of the present invention, contains the 
antibodies according to the invention themselves. With the aid of these antibodies, 
it is possible, for example, to easily and rapidly investigate a tissue sample as to 
whether the concerned polypeptide according to the invention is present in an in- 
creased amount in order to thereby obtain an indication of possible disease inclu- 
ding liver disorders, for example HCC. In this case, the antibodies according to the 
invention are preferably labeled directly, or more commonly for example these are 
detected with a specific secondary antibody indirectly, such as with an enzyme or 
fluorescent molecule, as already described above. The specific antibody-peptide 
complex can thereby be detected easily and rapidly, e.g., by means of an enzymatic 
color reaction. 

In still another aspect of the invention it is provided a method for identifying at 
least one nucleic acid according to SEQ ID No. 10 to SEQ ID No. 19 or a variant 
thereof differentially expressed in a sample isolated from a patient relative to a 
reference library or a reference sample comprising the following steps: 

1. Detecting the expression of at least one nucleic acid according to SEQ ID 
No. 10 to SEQ ID No. 19 or a variant thereof in a sample isolated from a 
patient, 

2. Comparing the expression of at least one nucleic acid detected in step 1 with 
the expression of the same at least one nucleic acid in a reference library or 
in a reference sample, 

3. Identifying this at least one nucleic acid which is differentially expressed in 
the sample isolated from the patient compared to the reference library or the 
reference sample. 

Compared to the state of the art the method surprisingly allows improved, more 
sensitive, earlier, faster, and/or non-invasive identification of differentially ex- 
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pressed nucleic acids according to the invention which provides a useful basis for 
diagnosing a disorder according to the invention. 

Preferably at least 2, at least 3, at least 4 at least 5, at least 6, or at least 7 nucleic 
acids are identified. 

In another preferred embodiment of the method the at least one nucleic acid is de- 
tected by PCR based detection or by a hybridization assay. 

In another preferred embodiment of the method the expression of the at least one 
nucleic acid is compared by a method selected from the group consisting of so- 
lid-phase based screening methods, hybridization, subtractive hybridization, diffe- 
rential display, and RNase protection assay. 

In a further preferred embodiment of the method the sample isolated from the pati- 
ent is selected from the group consisting of liver tissue, a liver cell, tissue from 
another organ subject to cancerous transformation, a cell from this organ, blood, 
serum, plasma, ascitic fluid, pleural effusion, cerebral spinal fluid, saliva, urine, 
semen, and feces. 

Preferably the reference sample is isolated from a source selected from a 
non-diseased sample of the same patient or a non-diseased sample from another 
subject. The selection of appropriate reference samples is generally known to the 
person skilled in the art In particular the reference sample may be selected from the 
group consisting of liver tissue, a liver cell, blood, serum, plasma, ascitic fluid, 
pleural effusion, cerebral spinal fluid, saliva, urine, semen, and feces. 

In another preferred embodiment of the method, the reference library is an expres- 
sion library or a data base containing clones or data on non-diseased expression of 
at least one nucleic acid according to the invention in samples that preferably may 
be selected from the group consisting of liver tissue, a liver cell, blood, serum, 
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plasma, ascitic fluid, pleural effusion, cerebral spinal fluid, saliva, urine, semen, 
and feces. 

In another aspect of the invention it is provided a method of diagnosing a liver di- 
sorder, and/or another epithelial cancer comprising the following steps: 

1. Detecting the expression of at least one nucleic acid according to SEQ ID 
No. 10 to SEQ ID No, 19 or a variant thereof in a sample isolated from a 
patient, 

2. Comparing the expression of at least one nucleic acid detected in step 1 with 
the expression of the same at least one nucleic acid in a reference library or 
in a reference sample, 

3. Identifying this at least one nucleic acid which is differentially expressed in 
the sample isolated from the patient compared to the reference library or the 
reference sample, and 

4. Matching the at least one nucleic acid identified in step 3 with at least one 
nucleic acid differentially expressed in a pathologic reference sample or 
pathologic reference library, 

wherein the matched nucleic acid(s) are indicative of the patient suffering from a 
liver disorder, and/or other epithelial cancer. 

Compared to the state of the art, this method of diagnosing surprisingly allows 
improved, more sensitive, earlier, faster, and/or non-invasive diagnosis of the liver 
disorders and/or other epithelial cancers. 

Preferably at least 2, at least 3, at least 4 at least 5, at least 6, or at least 7 nucleic 
acids are identified. 

In another preferred embodiment of the method the at least one nucleic acid is de- 
tected by PCR based detection or by a hybridization assay. 
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In another preferred embodiment of the method the expression of the at least one 
nucleic acid is compared by a method selected from the group consisting of so- 
lid-phase based screening methods, hybridization, subtractive hybridization, diffe- 
rential display, and RNase protection assay. 

In a further preferred embodiment of the method the sample isolated from the pati- 
ent is selected from the group consisting of liver tissue, a liver cell, tissue from 
another organ subject to cancerous transfonnation, a cell from this organ, blood, 
serum, plasma, ascitic fluid, pleural effusion, cerebral spinal fluid, saliva, urine, 
semen, and feces. - • 

Preferably the reference sample is isolated from a source selected from a 
non-diseased sample of the same patient or a non-diseased sample from another 
subject. The selection of appropriate reference samples is generally known to the 
person skilled in the art. In particular the reference sample may be selected from the 
group consisting of liver tissue, a liver cell, blood, serum, plasma, ascitic fluid, 
pleural effusion, cerebral spinal fluid, saliva, urine, semen, and feces. 

t. 

In another preferred embodiment of the method of diagnosis, the reference library 
is an expression library or a data base containing clones or data on non-diseased 
expression of the at least one nucleic acid according to the invention in samples that 
preferably may be selected from the group consisting of liver tissue, a liver cell, 
blood, serum, plasma, ascitic fluid, pleural effusion, cerebral spinal fluid, saliva, 
urine, semen, and feces. 

In another preferred embodiment of the method of diagnosis, the pathologic refe- 
rence sample is isolated from a diseased sample from another patient. The latter 
patient having been diagnosed as suffering from the disorder according to the in- 
vention which is to be diagnosed. The selection of appropriate pathologic reference 
samples is generally known to the person skilled in the art. In particular the patho- 
logic reference sample may be selected from the group consisting of liver tissue, a 
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liver cell, blood, serum, plasma, ascitic fluid, pleural effusion, cerebral spinal fluid, 
saliva, urine, semen, and feces. 

In another preferred embodiment of the method of diagnosis, the pathologic refe- 
rence library is a data base containing data on differential expression of the at least 
one nucleic acid according to the invention in samples isolated from at least one 
patient, excluding the patient under diagnosis, suffering from the disorder according 
to the invention to be diagnosed in the inventive method relative to control expres- 
sion in a reference sample or reference library. The pathologic reference library 
preferably also relates to a differential expression library containing nucleic acids 
according to the invention which are differentially expressed in samples isolated 
from at least one patient, excluding the patient under diagnosis, suffering from the 
disorder according to the invention to be diagnosed in the inventive method relative 
to control expression in a reference sample or reference library. The selection of an 
appropriate pathologic reference library is generally known to the person skilled in 
the art 

Preferably the liver disorder, is a disorder selected from the group consisting of 
cirrhosis, alcoholic liver disease, chronic hepatitis, Wilson's Disease, heamochro- 
matosis, hepatocellular carcinoma, benign liver neoplasms, and focal nodular hy- 
perplasia. In particular the epithelial cancer is an adenocarcinoma of any organ 
other than liver, preferably of an organ selected from the group consisting of the 
lung, die stomach, the kidney, the colon, the prostate, the skin, and the breast. 

Within the meaning of the invention the term "detecting a nucleic acid" refers to a 
method that preferably uncovers, visualizes, separates or allows recognition of the 
nucleic acid according to the invention from the background of the other compo- 
nents present in the sample. Such methods are generally known to the person skil- 
led in the art and include in situ hybridization, PCR amplication, gel electropho- 
resis, northern blots, solid phase array (gene chips) based methods, nuclease pro- 
tection methods (as described and referenced in Alberts, et al. (2002) The Molecu- 
lar Biology of the Cell, 4 th ed. Garland, New York, USA). 
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Withinthe meaning of the invention the term "comparing the expression of at least 
one nucleic acid detected in step 1 with the expression of the same at least one 
nucleic acid in a reference library or in a reference sample" refers to a comparison 
of the expression of the two groups of at least one nucleic acid on a quantitative or 
qualitative level by means of an experimental procedure such as differential dis- 
play, subtractive hybridization, RNAse protection assay, or especially DNA chip 
hybridization. Moreover a comparison of experimental data on the at least one 
nucleic acid detected in step 1 with the expression of the nucleic acids in a refe- 
rence library as defined above is also included herein. 

The term "Identifying this at least one nucleic acid which is differentially ex- 
pressed in the sample isolated from the patient compared to the reference library or 
the reference sample" within the meaning of the present invention is understood to 
mean selecting mis at least one nucleic acid which is differentially expressed com- 
pared to the reference library or the reference samples which fulfills the following 
criteria: the level of differential expression of the detected at least one nucleic acid 
compared to the reference library or the reference samples is greater than about 2 
fold, preferably greater than about 5 fold, more preferred greater than about 10 fold 
upregulated. 

The term "matching the at least one nucleic acid identified in step 3 with at least 
one nucleic acid differentially expressed in a pathologic reference sample or pa- 
thologic reference library " within the meaning of the invention is understood to 
mean that this at least one nucleic acid identified in step 3 is compared with at least 
one nucleic acid differentially expressed in a pathologic reference sample or pa- 
thologic reference library. Then this the at least one nucleic acid identified in step 3 
that is also differentially expressed in the pathologic reference sample or patholo- 
gic reference library is matched, i.e. at least one identical pair is identified and al- 
located. Since the differential expression of the at least one nucleic acid in the pa- 
thologic reference sample or pathologic reference library is indicative of a disorder 
according to the invention, such correspondence with the differential expression in 
the sample then indicates that the patient suffers from that disorder. 
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Preferably the sample is isolated from a patient by non-invasive or preferably mi- 
nimally invasive methods such as described above, including venupuncture. . 

The methods of diagnosing according to the invention allows early detection of a 
liver disorder and/or an epitherlial cancer, and/or non-invasive diagnosis of the 
disorder, based on an essentially concordant expression pattern of the nucleic acids 
according to the invention detected in the samples isolated from an animal and/or a 
human patient suffering from a liver disorder and/or an epithelial cancer relative to 
a reference sample or relative to a reference library. The method has the additional 
advantage that it also provides additional and novel diagnostic parameters to cha- 
racterize different subtypes of liver disorders, such as for example subtypes of 
HCC. 

The term "essentially concordant expression pattern" of the nucleic acids according 
to the invention refers to a pattern of expression that is essentially reproducible 
from patient to patient or subject to subject, provided that the patients or subjects 
compared are in the same or comparable pathological condition or healthy conditi- 
on, respectively. 

In still another aspect of the invention it is provided a method for identifying at 
least one polypeptide according to SEQ ID No. 1 to SEQ ID No. 9 or a functional 
variant thereof differentially expressed in a sample isolated from a patient relative 
to a reference library or a reference sample comprising the following steps: 

1. Detecting the expression of at least one polypeptide according to SEQ ID 
No. 1 to SEQ ID No. 9 or a functional variant thereof in a sample isolated 
from a patient, 

2. Comparing the expression of at least one polypeptide detected in step 1 with 
the expression of the same at least one polypeptide in a reference library or 
in a reference sample, 
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3- Identifying this at least one polypeptide which is differentially expressed in 
the sample isolated from the patient compared to the reference library or the 
reference sample. 

Compared to the state of the art, this method surprisingly allows improved, more 
sensitive, earlier, faster, and/or non-invasive identification of differentially ex- 
pressed polypeptides according to the invention which provides a useful basis for 
diagnosing a disorder according to the invention. 

Preferably at least 2, at least 3, at least 4, at least 5, at least 6, or at least 7 polypep- 
tides are identified. 

Preferably the sample is isolated from a patient by non-invasive or minimally inva- 
sive methods such as described above, including venupuncture. 

In another embodiment of the method the sample is a sample as defined further 
above. Preferably the reference sample is a reference sample as defined above. 

In another preferred embodiment of the method, the reference library is an expres- 
sion library or a data base containing clones or data on non-diseased expression of 
the at least one polypeptide according to the invention in samples that preferably 
may be selected from the group consisting of liver tissue, a liver cell, blood, serum, 
plasma, ascitic fluid, pleural effusion, cerebral spinal fluid, saliva, urine, semen, 
and feces. Such data bases are generated as a result of die cDNA microarray ex- 
pression analysis according to the invention and are known to persons skilled in the 
art. Further reference libraries useable according to the invention have been descri- 
bed above. 

In another aspect of the invention it is provided a method of diagnosing a liver di- 
sorder and/or an epithelial cancer comprising the following steps: 
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1. Detecting the expression of at least one polypeptide according to SEQ DO 
No. 1 to SEQ ID No. 9 or a functional variant thereof in a sample isolated 
from a patient, 

2. Comparing the expression of at least one polypeptide detected in step 1 with 
the expression of the same at least one polypeptide in a reference library or 
in a reference sample, 

3. Identifying this at least one polypeptide which is differentially expressed in 
the sample isolated from the patient compared to the reference library or the 
reference sample, and 

4. Matching the at least one polypeptide identified in step 3 with at least one 
polypeptide differentially expressed in a pathologic reference sample or 
pathologic reference library, 

wherein the matched polypeptide(s) are indicative of the patient suffering from a 
liver disorder and/or an epithelial cancer. 

Compared to the state of the art, this method of diagnosing surprisingly allows 
improved, more sensitive, earlier, faster, and/or non-invasive diagnosis of the liver 
disorders and/or other epithelial cancers. 

Preferably at least 2, at least 3, at least 4, at least 5, at least 6, or at least 7 polypep- 
tides are identified. 

Within the meaning of the invention the term "detecting a polypeptide" refers to a 
method that preferably uncovers, visualizes, separates and/or allows recognition of 
the polypeptide according to the invention from the background of the other com- 
ponents present in the sample. Such methods are generally known to the person 
skilled in the art and includes gel electrophoresis, chromatographic techniques, 
immunoblot analysis, immunohistochemistry, enzyme based immunoassay, mass 
spectroscopy, high pressure liquid chromatography, surface plasmon resonance, 
and/or antibody and protein arrays as described above (Ausubel, F.A. et al., eds., 
1990, Current Protocols in Molecular Biology. Greene Publishing and Wiley- 
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Interscience, New York, USA, Chapter 10; Myszka and Rich 2000, Pharm. Sci. 
TechnoL Today 3:310-317)- Preferably proteins and polypeptides are prepared from 
the sample by disruption of the cells with physical sheering or ultrasonic means, for 
example. Protein is denatured and stabilized with reducing agent treatment and 
heating and the protein is size fractionated on electrophoretic polyacrylamide gels. 

Within the meaning of the invention the term "comparing the expression of at least 
one polypeptide detected in step 1 with the expression of the same at least one po- 
lypeptide in a reference library or in a reference sample " refers to a comparison of 
the expression of the two groups of at least one polypeptide on a quantitative 
and/or qualitative level by means of an experimental procedure such as two dimen- 
sional gel electrophoresis, chromatographic separation techniques, immunoblot 
analysis, surface plasmon resonance, immunohistochemistry, and enzyme based 
immunoassay. In two dimensional gel electrophoresis all peptides are first resolved 
according to isoelectric point in the first electrophoretic dimension and then by size 
according to methods well known to persons experienced in the art. Moreover a 
comparison of experimental data on the at least one polypeptide detected in step 1 
with the expression of the polypeptide in a reference library as defined above is 
also included herein. 

The term "Identifying this at least one polypeptide which is differentially expressed 
in the sample isolated from the patient compared to the reference library or the 
reference sample" within the meaning of the present invention is understood to 
mean selecting this at least one polypeptide which is differentially expressed com- 
pared to the reference library or the reference samples which fulfills the following 
criteria: the level. of differential expression of the detected at least one polypeptide 
compared to the reference library or the reference samples is greater than about 2 
fold, preferably greater than about 5 fold, more preferred greater than about 10 fold 
upregulated. 

The term "matching the at least one polypeptide identified in step 3 with at least 
one polypeptide differentially expressed in a pathologic reference sample or patho- 
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logic reference library n within the meaning of the invention is understood to mean 
that this at least one polypeptide identified in step 3 is compared with at least one 
polypeptide differentially expressed in a pathologic reference sample or pathologic 
reference library. Then this the at least one polypeptide identified in step 3 that is 
also differentially expressed in the pathologic reference sample or pathologic refe- 
rence library is matched, i.e. at least one identical pair is identified and allocated. 
Since the differential expression of the at least one polypeptide in the pathologic 
reference sample or pathologic reference library is indicative of a disorder accor- 
ding to the invention, such correspondence with the differential expression in the 
sample then indicates that the patient suffers from that disorder. 

Preferably the sample is isolated from a patient by non-invasive or minimally inva- 
sive methods such as described above, including venupuncture. 

La another embodiment of the method the sample is a sample as defined further 
above. Preferably the reference sample is a reference sample as defined above. 

In another preferred embodiment of the method of diagnosis, the reference library 
is an expression library or a database containing clones or data on non-diseased 
expression of the at least one polypeptide according to the invention in samples that 
preferably may be selected from the group consisting of liver tissue, a liver cell, 
blood, serum, plasma, ascitic fluid, pleural effusion, cerebral spinal fluid, saliva, 
urine, semen, and feces. 

An example of a data base according to the invention and further experimental 
reference libraries useable according to the invention have been described above. 

In another preferred embodiment of the method of diagnosis, the pathologic refe- 
rence sample is a pathologic reference sample as has been defined above. 
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In another preferred embodiment of the method of diagnosis, the pathologic refe- 
rence library is a data base containing data on differential expression of the at least 
one polypeptide according to the invention in samples isolated from at least one 
patient, excluding the patient under diagnosis, suffering from the disorder according 
to the invention to be diagnosed in the inventive method relative to control expres- 
sion in a reference sample or reference library. The pathologic reference library 
also relates to a differential expression library containing polypeptides according to 
the invention which are differentially expressed in samples isolated from at least 
one patient, excluding the patient under diagnosis, suffering from the disorder ac- 
cording to the invention to be diagnosed in the inventive method relative to control 
expression in a reference sample or reference library. The selection of an appro- 
priate pathologic reference library is generally known to the person skilled in the 
art. 

Preferably the liver disorder, is a disorder selected from the group consisting of 
cirrhosis, alcoholic liver disease, chronic hepatitis, Wilson's Disease, heamochro- 
matosis, hepatocellular carcinoma, benign liver neoplasms, and focal nodular hy- 
perplasia. In particular the epithelial cancer is an adenocarcinoma of any organ 
other than liver, preferably of an organ selected from the group consisting of the 
lung, the stomach, the kidney, the coloa, the prostate, the skin, and the breast ■ 

The methods of diagnosing according to the invention allows early detection of a 
liver disorder and/or epithelial cancer, and/or non-invasive diagnosis of the disor- 
der, based on an essentially concordant expression pattern of the polypeptides ac- 
cording to the invention detected in the samples isolated from an animal and/or a 
human patient suffering from* a liver disorder and/or epithelial cancer relative to a 
reference sample or relative to a reference library. The method has the additional 
advantage that it also provides additional and novel diagnostic parameters to cha- 
racterize different subtypes of liver disorders , such as for example subtypes of 
HCC. 
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The term "essentially concordant expression pattern" of the polypeptides according 
to the invention refers to a pattern of expression that is essentially reproducible 
from patient to patient or subject to subject, provided that the patients or subjects 
compared are in the same or comparable pathological condition or healthy conditi- 
on, respectively. 

In another aspect of the invention it is provided a pharmaceutical composition 
containing at least one compound selected from the group consisting of a polypep- 
tide according to the invention, a functional variant thereof, a nucleic acid encoding 
the polypeptide, a nucleic acid according to SEQ ID No. 19, a variant of one of the 
aforementioned nucleic acids, a nucleic acid which is a non-functional mutant vari- 
ant of one of the aforementioned nucleic acids, a nucleic acid having a sequence 
complementary to one of the aforementioned nucleic acids, a vector containing one 
of the aforementioned nucleic acids, a cell containing one of the aforementioned 
nucleic acids, a cell containing the vector, an antibody directed against the poly- 
peptide, a fragment of the antibody, a vector containing a nucleic acid coding for 
the antibody, a cell containing the vector containing a nucleic acid coding for the 
antibody, and a cell containing the vector containing a nucleic acid coding for the 
antibody fragment, combined or together with suitable additives or auxiliaries. In a 
preferred embodiment the pharmaceutical composition contains at least one cell ' - 
according to the invention, combined or together with suitable additives or auxilia- 
ries. 

When compared to the state of the art of therapy of liver disorders, and/or other 
epithelial cancers the pharmaceutical composition according to the invention sur- 
prisingly provide an improved, sustained and/or more effective treatment. 

A pharmaceutical composition in the sense of the invention encompasses medica- 
ments which can be used for preventing and/or treating a liver disorders and/or 
epithelial cancer. The pharmaceutical composition includes, for instance, a stabili- 
zed recombinant antibody that has been produced by expression of specific antibo- 
dy gene fragments in a cellular system, preferably a eukaryotic system. A recombi- 
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nant antibody therapeutic for instance, is deUvered by injection into the diseased 
liver region or into the venous or arterial vascular systems or into the hepatic portal 
system. The injections can be repeated at regular intervals to achieve therapeutic 
efficacy. Therapeutics according this invention may also be employed in combina- 
tions with other chemical, antibody, or any other therapeutic application to improve 
efficacy. 

The present invention also relates to a process producing a pharmaceutical compo- 
sition for the treatment and/or prevention of disorders according to the invention, 
for example, HCC, in which at least one component selected from the group con- 
sisting of a polypeptide according to the invention, a functional variant thereof, a 
nucleic acid encoding the polypeptide or a nucleic acid according to SEQ ID No. 
19, a variant of one of the aforementioned nucleic acids, a nucleic acid which is a 
non-functional mutant variant of one of the aforementioned nucleic acids, a nucleic 
acid having a sequence complementary to one of the aforementioned nucleic acids, 
a vector containing one of the aforementioned nucleic acids, a cell containing one 
of the aforementioned nucleic acids, a cell containing the vector, an antibody di- 
rected against the polypeptide, a fragment of the antibody, a vector containing a 
nucleic acid coding for the antibody, a cell containing the vector containing a nu- 
cleic acid coding for the antibody, and a cell containing the vector containing a 
nucleic acid coding for the antibody fragment, is combined or together with suit- 
able additives. 

The present invention furthermore relates to a pharmaceutical composition produ- 
ced by this process for the treatment and/or prevention of liver disorders and/or 
epithelial cancers, for example, HCC, which contains at least one component se- 
lected from the group consisting of a polypeptide according to the invention, a 
functional variant thereof, a nucleic acid encoding the polypeptide or a nucleic acid 
according to SEQ ID No. 19, a variant of one of the aforementioned nucleic acids, 
a nucleic acid which is a non-functional mutant variant of one of the aforemen- 
tioned nucleic acids, a nucleic acid having a sequence complementary to one of the 
aforementioned nucleic acids, a vector containing one of the aforementioned 
nucleic acids, a cell containing one of the aforementioned nucleic acids, a cell 
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containing the vector, an antibody directed against the polypeptide, a fragment of 
the antibody, a vector containing a nucleic acid coding for the antibody, a cell 
containing the vector containing a nucleic acid coding for the antibody, and a cell 
containing the vector containing a nucleic acid coding for the antibody fragment, , 
if appropriate together with suitable additives and auxiliaries. The invention 
furthermore relates to the use of this pharmaceutical composition for the prevention 
and/or treatment of liver disorders, for example, HCC and/or epithelial cancer. 

Preferably the pharmaceutical composition is employed for the treatment of a liver 
disorder selected from the group consisting of cirrhosis, alcoholic liver disease, 
chronic hepatitis, Wilson's Disease, heamochromatosis, hepatocellular carcinoma, 
benign liver neoplasms, and focal nodular hyperplasia. In particular the phrama- 
ceutical composition is employed for the treatment of an epithelial cancer which is 
an adenocarcinoma of any organ other than liver, preferably of an organ selected 
from the group consisting of the lung, the stomach, the kidney, the colon, the pro- 
state, the skin, and the breast 

Therapy can also be carried out in a conventional manner generally known to the 
person skilled in the art, e.g. by means of oral application or via intravenous injec- 
tion of the pharmaceutical compositions according to the invention. It is thus pos- 
sible to administer the pharmaceutical composition containing the suitable additi- 
ves or auxiliaries, such as, for example, physiological saline solution, deminerali- 
zed water, stabilizers, proteinase inhibitors. 

A therapy based on the use of cells, which express at least one polypeptide ac- 
cording to the invention, of functional variants thereof or nucleic acids coding for 
the polypeptide or a nucleic acid according to the SEQ ID No. 19, or variants 
thereof can be achieved by using autologous or heterologous cells. Preferred cells 
comprise liver cells, for example primary cultures of liver cells, liver populating 
stem or progenitor cells, or blood cells. The cells can be applied to the tissue, pref- 
erably to the blood or injected into the liver, with suitable carrier material. Such 
therapy is preferably based on the notion that upon expression and/or release of a 
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polypeptide according to the invention the polypeptide stimulates an immune re- 
sponse in the patient in need of the treatment. 

Preferably the therapeutical approach is directed toward inhibiting the function 
and/or expression of at least one polypeptides according to the invention and/or the 
function and/or expression of at least one nucleic acids according to the invention. 
Such inhibition of the expression and/or function preferably reduces the expression 
and/or function of the targeted nucleic acid/polypeptide significantly. The inhibi- 
tion of the expression and/or function preferably abolishes the expression and/or 
functioning of Jhe„ targeted nucleiq acid/polypeptide. Such reduction, or abolished 
expression and/or fimctioning of the targeted nucleic acid/polypeptide can be de- 
termined using conventional assays for detennining the expression and/or func- 
tioning of a polypeptide/nucleic acid generally known to the person skilled in the 
art. In particular such assays for determining the function comprise methods for 
comparing the biological activity of the targeted nucleic acid/polypeptide before 
and after administration of the pharmaceutical composition. Preferably such assays 
for determining the expression comprise methods for comparing the level of ex- 
pression of the targeted nucleic acid/polypeptide before and after administration of 
the pharmaceutical composition. 

Such therapy is preferably accomplished by the use of a nucleic acid having a se- 
quence complementary to one of nucleic acids according to the invention, i.e. an 
antisense molecule or a RNA interference molecule which reduces or abolishes the 
translation of transcribed nucleic acids according to the invention and thereby in- 
hibits the function and/or expression of the targeted nucleic acid/polypeptide. Pref- 
erably such nucleic -acid, having a complementary sequence may be employed in 
the form of a vector or a cell containing such nucleic acid. On the polypeptide level 
the therapy may in particular be carried out by the use of an antibody or an anti- 
body fragment directed against a polypeptide according to the invention. The anti- 
body or antibody fragment may be administered directly to the patient or preferably 
the nucleic acid encoding the antibody is contained in a vector which is preferably 
contained in a cell. The cell or vector may then be administered to the patient in 
need of such treatment. 
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When compared to the state of the art of therapy of liver disorders, and/or other 
epithelial cancers the method of treating according to the invention surprisingly 
provide an improved, sustained and/or more effective treatment 

The invention further relates to a method of treating a patient suffering from of a 
liver disorder, wherein at least one component selected from the group consisting 
of a polypeptide according to the invention, a functional variant thereof, a nucleic 
acid encoding the polypeptide or a nucleic acid according to SEQ ID No. 19, a 
variant of one of the aforementioned nucleic acids, a nucleic acid which is a non- 
functional mutant variant of one of the aforementioned nucleic acids, a nucleic acid 
having a sequence complementary to one of the aforementioned nucleic acids, a 
vector containing one of the aforementioned nucleic acids, a cell containing one of 
the aforementioned nucleic acids, a cell containing the vector, an antibody directed 
against the polypeptide, a fragment of the antibody, a vector containing a nucleic 
acid coding for the antibody, a cell containing the vector containing a nucleic acid 
coding for the antibody, and a cell containing the vector containing a nucleic acid 
coding for the antibody fragment, optionally combined or together with suitable 
additives and/or auxilaries, is administered to the patient in need of a the treatment 
in a therapeutically effective amount. 

Preferably the method of treatment is directed to a liver disorder selected from the . 
group consisting of cirrhosis, alcoholic liver disease, chronic hepatitis, Wilson's 
Disease, heamochromatosis, hepatocellular carcinoma, benign liver neoplasms, and 
focal nodular hyperplasia. In particular the method of treatment is directed to an 
epithelial cancer which is an adenocarcinoma of any organ other than liver, pref- 
erably of an organ selected from the group consisting of the lung, the stomach, the 
kidney, the colon, the prostate, the skin, and the breast. 

Methods of administering such compounds or cells have been described in detail 
above. 
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The term "therapeutically effective amount" refers to the administration of an 
amount of the compound to the patient that results in an "effective treatment" as 
defined above. Determination of the therapeutically effective amount of the com- 
pound(s) is generally known to the person skilled in the art. 

Such methods of treating allow effective treatment of a liver disorder and/or epit- 
helial cancers as described above. 

In another aspect of the invention it is provided a method of stimulating an immune 
response a patient suffering from a liver disorder and/or an epithelial cancer to a 
polypeptide according to the invention or a functional variant thereof, wherein at 
least one component selected from the group consisting of a polypeptide according 
to the invention, a functional variant thereof, a nucleic acid encoding the polypep- 
tide, a nucleic acid coding for the SEQ ID No. 19, a variant of one of the afore- 
mentioned nucleic acids, a vector containing one of the aforementioned nucleic 
acids, a cell containing one of the aforementioned nucleic acids, and a cell contai- 
ning the vector, is administered to the patient in need of such treatment in an 
amount effective to stimulate the immune response in the patient 

When compared to the state of the art of therapy of liver disorders, and/or other 
epithelial cancers the method of stimulating an immune response according to the 
invention surprisingly provide an improved, sustained and/or more effective im- 
munization. 

In another aspect of the invention it is provided a method of preventing a patient 
from developing a liver disorder and/or an epithelial cancer, wherein at least one 
component selected from the group consisting of a polypeptide according to the 
invention, a functional variant thereof, a nucleic acid encoding the polypeptide, a 
nucleic acid coding for the SEQ ID No. 19, a variant of one of the aforementioned 
nucleic acids, a nucleic acid having a sequence complementary to one of the afo- 
rementioned nucleic acids, a nucleic acid which is a non-functional mutant variant 
of one of the aforementioned nucleic acids, a vector containing one of the afore- 
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mentioned nucleic acids, a cell containing one of the aforementioned nucleic acids, 
and a cell containing the vector, is administered to the patient in need of such pre- 
ventive treatment in a therapeutically effective amount 

When compared to the state of the art of therapy of liver disorders, and/or other 
epithelial cancers the method of preventing according to the invention surprisingly 
provide an improved, sustained and/or more effective preventive measure. 

Preferably the method of preventing and/or method of stimulating an immune re- 
sponse is directed to a liver disorder selected from the group consisting of cirrhosis, 
alcoholic liver disease, chronic hepatitis, Wilson's Disease, heamochromatosis, 
hepatocellular carcinoma, benign liver neoplasms, and focal nodular hyperplasia. 
In particular, preferably the method of preventing and/or method of stimulating an 
immune response is directed to an epithelial cancer which is an adenocarcinoma of 
any organ other than liver, preferably of an organ selected from the group consi- 
sting of the lung, the stomach, the kidney, the colon, the prostate, the skin, and the 
breast. 

In a further aspect the invention relates to a method of identifying at least one 
pharmacologically active compound comprising the following steps: 

1 . providing at least one polypeptide according to the SEQ ID No. 2 or a func- 
tional variant thereof, 

2. contacting the at least one polypeptide, with at least one putative pharma- 
cologically active compound, 

3. assaying the interaction of the at least one putative pharmacologically active 
compound and the at least one polypeptide, 

4. identifying the at least one pharmacologically active compound among the 
assayed at least one putative pharmacologically active compound by selec- 
ting those assayed at least one putative pharmacologically active compound 
which directly or indirectly functionally interact with the at least one poly- 
peptide. 
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Preferably the at least one polypeptide is provided in a form selected from the 
group of the at least one polypeptide is attached to a column, the at least one poly- 
peptide is attached to an array, the at least one polypeptide is is contained in an 
electrophoresis gel, the at least one polypeptide is attached to a membrane, and the 
at least one polypeptide is expressed by a cell. 

It is preferred to assay the interaction by a method selected from the group of en- 
zyme and fluorescence based cellular reporter assays in which interaction of the at 
least one pharmacological substance with a recombinant fusion protein including 
the at least one polypeptide is detected. The interaction may preferably also be 
assayed by surface plasmon resonance, HPLC and mass spectroscopy. Preferably 
the direct or indirect functional interaction is selected from the group consisting of 
induction of the expression of the at least one polypeptide, inhibition of the expres- 
sion of the at least one polypeptide, activation of the function of the the at least one 
polypeptide, inhibition of the function of the the at least one polypeptide. 

The term "pharmacologically active substance" in the sense of the present inventi- 
on is understood as meaning all those molecules, compounds and/or compositions 
and substance mixtures which can interact under suitable conditions with the poly- 
peptide according to the SEQ ID No. 2 or functional variants thereof; if appropriate 
together with suitable additives and/or auxiliaries. Possible pharmacologicaUy ac- 
tive substances are simple chemical (organic or inorganic) molecules or com- 
pounds, but can also include peptides, proteins or complexes thereof. Examples of 
pharmacologically active substances are organic molecules that are derived from 
libraries of compounds that nave been analyzed for their pharmacological activity. 
On account of their interaction, the pharmacologically active substances can influ- 
ence the expression and/or functions) of the polypeptide in vivo or in vitro or al- 
ternatively only bind to the polypeptides described above or enter into other inter- 
actions of covalent or non-covalent manner with them 
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A suitable test system which can be used in accordance with the invention is based 
on identifying interactions with the two hybrid system (Fields and Sternglanz, 
1994, Trends in Genetics, 10, 286-292; Colas and Brent, 1998 HBTECH, 16, 355- 
363). In this test system, cells are transformed with expression vectors which ex- 
press fusion proteins which consist of at least one polypeptide according to the 
invention and a DNA-binding domain of a transcription factor such as Gal4 or 
LexA. The transformed cells also contain a reporter gene whose promoter contains 
binding sites for the corresponding DNA-binding domain. By means of transfor- 
ming a further expression vector, which expresses a second fusion protein consi- 
sting of a known or unknown polypeptide and an activation domain, for example 
from Gal4 or herpes simplex virus VP16, the expression of the reporter gene can 
be greatly increased if the second fusion protein interacts with the investigated 
polypeptide according to the invention. This increase in expression can be used for 
identifying new interacting partners, for example by preparing a cDNA library 
from e.g., liver tissue, or diseased liver tissue for the purpose of constructing the 
second fusion protein. In a preferred embodiment, the interaction partner is an in- 
hibitor of the polypeptide according to the SEQ ID No. 2 or functional variants 
thereof. This test system can also be used for screening substances which inhibit an 
interaction between the polypeptide according to the invention and an interacting 
partner. Such substances decrease the expression of the reporter gene in cells 
which are expressing fusion proteins of the polypeptide according to the invention 
and the interacting partner (Vidal and Endoh, 1999, Trends in Biotechnology, 17: 
374-81). In this way, it is possible to rapidly identify novel active compounds 
which can be employed for the therapy of and/or prevention of liver disorders 
and/or epithelial cancer. 



Assays for identifying pharmacologically active substances which exert an influ- 
ence on the expression of proteins are well known to the skilled person (see, for 
example, Sivaraja et al., 2001, US 6,183,956). Thus, cells which express the poly- 
peptide according to the SEQ ID No. 2 or functional variants thereof, can be cultu- 
red as a test system for analyzing gene expression in vitro, with preference being 
given to liver cells. Gene expression is analyzed, for example, at the level of the 
mRNA or of the proteins using methods generally known to the person skilled in 
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the art. In this connection, the quantity of polypeptide according to the SEQ ID No. 
2 or mRNA present after adding one or more putative pharmacologically active 
substances to the cell culture is measured and compared with the corresponding 
quantity in a control culture. This is done, for example, with the aid of an antibody 
specifically directed against the polypeptide according to the SEQ ID No. 2 or a 
functional variant thereof, which can be used to detect the polypeptide present in 
the lysate of the cells. The amount of expressed polypeptide can be quantified by 
methods generally known to the person skilled in the art using, for example, an 
EUSA or a Western blot. In this connection, it is possible to carry out the analysis 
as a high-throughput method and to analyze a very large number of substances for 
their suitability as modulators of the expression of the polypeptide according to the 
SEQ ID No. 2 (Sivaraja et al., 2001, US 6,183,956). In this connection, the sub- 
stances to be analyzed can be taken from substance libraries (see, e.g. 
DEI 9816414, DE19619373) which can contain several thousand substances which 
are frequently very heterogeneous. 

The invention will now be further illustrated below with the aid of the figures and 
examples, representing preferred embodiments and features of the invention with- 
out the invention being restricted hereto. 

inscri ption of the tables, fig ures and sequences: 

Figure 1: nucleic acids and polypeptides according to 

the invention and their respective SEQ ID 
Nos. and Accession numbers from the Gen- 
Bank. 



Figure 2: 



depicts the OBcll.pr polypeptide sequence of 
SEQ ID No. 1. 



Figure 3: 



Figure 4: 



Figure 5: 



Figure 6: 



Figure 7: 



Figure 8: 



Figure 9: 



Figure 10: 



Figure 11: 
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depicts the OBclS.pr polypeptide sequence of 
SEQ ID No, 2. 



depicts the DC2.pr polypeptide sequence of 
SEQ ID No. 3. 



depicts the IKS.pr polypeptide sequence of 
SEQ ID No. 4. 



depicts the DAP3.pr polypeptide sequence of 
SEQ ID No. 5. 



depicts the LOCS.pr polypeptide sequence of 
SEQ ID No. 6. 



depicts the SEC 1 4L2.pr polypeptide sequence 
of SEQ ID No. 7. 



depicts the SSP29.pr polypeptide sequence of 
SEQ ID No. 8. 



depicts the HS16.pr polypeptide sequence of 
SEQ ID No. 9. 



depicts proposed full length OBcll cDNA 
sequence (SEQ ID 10), sequence from dere- 
gulated SSH cDNA clone given in small ca- 
pital letters. Sequence compiled includes 
GenBank entries AL050205, 2312bp - 3' end 
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(including SSH sequence); AF131755, 
2135bp ; XM_1 13703 2130bp (w/open rea- 
ding frame (ORF); AY004310 1615bp (also 
w/ORF). 

depicts the OBcl5 nucleic acid sequence of 
SEQ ID No. 11. 

depicts the IK2 nucleic acid sequence of SEQ 
ID No. 12. 

depicts the IK5 nucleic acid sequence of SEQ 
ID No. 13. 

depicts the DAP3 nucleic acid sequence of 
SEQ ID No. 14. 

depicts die proposed full length LOC5 mRNA 
sequence for deregulated cDNA clone (SEQ 
ID 15). The sequence corresponds to Gen- 
Bank Accession no. NM_0 18447. 

depicts the SEC14L2 nucleic acid sequence 
of SEQ ID No. 16 

depicts the SSP29 nucleic acid sequence of 
SEQ ID No. 17 



depicts the HS16 nucleic acid sequence of 
SEQ ID No. 18 



depicts the IK3 nucleic acid sequence of SEQ 
ID No. 19 

cDNA microarray nucleic acid expression 
level ratios. The expression levels relative to 
a non-diseased liver reference sample of se- 
quences according to the invention assessed 
in tissues from human liver disorders, inclu- 
ding HCC. The values are ratios of expressi- 
on levels between diseased and non-diseased 
liver samples, obtained from competitive hy- 
bridization to custom-made cDNA micro- 
arrays. HCC = hepatocellular carcinoma 
samples; FNH = focal nodular hyperplasia 
samples; Cirrh = cirrhosis samples; Copper 
tox. = cirrhosis with copper toxicity etiology; 
Adenoma « adenoma; Non-dis. liver - Non- 
diseased liver; mean = average of ratio values 
for each group (HCC, FNH and Ciirh). 

Summary of cDNA microarray nucleic acid 
expression values. The median expression - 
level value from experimental (diseased) and 
control (non-diseased) tissues for each 
nucleic acid according to the invention is 
provided. The Wilcoxon rank sum two-sided 
test was applied to these data (paired flag = 
'false') to demonstrate the difference between 
groups (Hollander & Wolfe, 1973, Nonpara- 
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metric statistical inference. New York: John 
Wiley & Sons, pgs. 27-33, 68-75; Bauer, 
D.F., 1972, J- Amen Statistical Assoc. 67: 
687-690). The expression values typically do 
not fit a normal distribution so averaging the 
values may be misleading. However, analysis 
of the median values demonstrates significant 
differences in most cases between experi- 
mental and reference values, particularly in 
the large data sets. Expt. median = median 
value for experimental (diseased) tissues; 
Expt. iqr = experimental value interquartile 
range (+/- 25% of median value); Contr. me- 
dian ■? median value for control (non- 
diseased) tissue samples; Contr. iqr - control 
value interquartile range (+/- 25% of median 
value); p value « value resulting from stati- 
stical evaluation of the probability that the 
experimental and control values are signifi- 
cantly different. Ninety-five percent confi- 
dence in difference significance provides a p 
value of 0.05. 

Summary boxplot of expression values in 
HCC versus non-diseased liver cDNA micro- 
array experiments. The box plot is a graphical 
representation of all expression value ratios 
with the median value indicated by a hori- 
zontal line. The extent of each box indicates 
the interquartile range (iqr, +/- 25% of the 
median value); error bars indicate 1 .5 times 
the iqr. Ratios that do not fall within this ran- 
ge are indicated as circles. For each nucleic 
acid according to the invention, elevated ex- 
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pression is apparent in HCC (at least 2 fold 
high than the non-diseased liver values; the y- 
axis indicates the fold induction of gene ex- 
pression in experimental vs. control tissues). 
Expression values are consistently elevated in 
a similar ratio except for OBcl5 (SEQ ID No. 
1 1) where the differences in expression bet- 
ween patient and control samples is most 
dramatically different 

Comparison of nucleic acid expression values 
in non-neoplastic liver diseases and liver can- 
cer. For each nucleic acid according to the 
invention a p value is provided for the diffe- 
rence in the median experimental expression 
values for comparisons between FNH, liver 
cirrhosis (Cirrh.) and HCC samples. For each 
nucleic acid and comparison a value of less 
than or equal to 0.05 indicates a significant 
difference in expression values between the 
disease groups. Significance was assessed 
with the Wilcoxon rank sum test. Statistically 
significant differences in expression are evi- 
dent between disease groups. For example, 
the expression values for IK2 are signifi- 
cantly different in all three comparisons (p 
values less than 0.05). The FNH sample 
group is small and displayed a large distribu- 
tion of values which likely accounts for fewer 
significant differences in comparisons with 
this group. 
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RT-PCR analysis of nucleic acid expression 
in human non-diseased and disease tissues. 
Reverse transcriptase polymerase chain reac- 
tion (RT-PCR) was performed with primers 
specific for each deregulated nucleic acid in 
each tissue listed. All tissues employed were 
diagnostically confirmed prior to utilization 
for RNA (and cDNA) preparation. The 
symbol indicates that the gene is expressed in 
the tissue, the "-" indicates that this gene was 
not detected in cDNA from this RNA sample; 
and a blank box indicates that the analysis 
was not performed for that gene and tissue 
combination. The patient's age and sex and 
provided. Additional sample information in- 
cludes the tumor staging value (T - tumor si- 
ze), as well as the tumor grading score (G = 
tumor cell differentiation); large numbers in- 
dicate larger and less well-differentiated tu- 
mors, respectively. The positive control for 
tissue cDNA is amplification from the gly- 
ceraldehyde phosphate dehydrogenase 
mRNA (GAPDH). 

Representative RT-PCR data demonstrating 
expression of the nucleic acids according to 
the invention in independent HCC samples 
and controls. Amplification of the •housekee- 
ping' gene GAPDH was included in parallel 
reactions with each cDNA template to control 
for cDNA quality. 5 to 10% of the RT-PCR 
reaction products subjected to 30-40 PCR cy- 
cles were loaded onto this agarose gel. Purl- 
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fied DNA from the HCC library pool is in- 
cluded as a positive control (C) for each 
nucleic acid according to the invention. Two 
independent HCC samples (H) were included 
in this analysis together with one non- 
diseased liver sample (N) for representative 
nucleic acids according to the invention. M = 
molecular mass marker (100 bp ladder). 

Figure 27: Verification of differential gene expression 

by RNA blot Independent evaluation of 
RNA samples from a pool of 3 non-diseased 
livers (L) and from 2 HCC tissues (H) veri- 
fies the increased expression of OBcll (SEQ 
ID No. 10) and OBcI5 (SEQ ID No. 1 1) in 
this image of an RNA blot autoradiogram as 
indicated on the figure. The results from the 
antisense strand probe (top; specific signal) 
and the corresponding sense strand probe ne- 
gative control (bottom) demonstrate the spe- 
cificity of hybridization with the antisense 
probe. 



SEQ ID No. 1 to SEQ ID No. 9 refer to the human sequences of polypeptide 

according to the invention from. 

SEQ ID No. 10 to SEQ ID No. 19 refer to the human cDNA sequences accor- 
ding to the invention. 



Further embodiments and preferred features of the invention are provided below. 
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An isolated polypeptide comprising a sequence according to SEQ ID No. 2 
or a functional variant thereof. 

A fusion protein, wherein the fusion protein contains a polypeptide accor- 
ding to the embodiment 1. 

An isolated nucleic acid encoding a polypeptide according to the embodi- 
ment 1 or a variant thereof. 

A nucleic acid according to the embodiment 3, wherein the nucleic acid is a 
single-stranded or double-stranded RNA. 

A nucleic acid according to the embodiment 3, wherein the nucleic acid is a 
nucleic acid according to SEQ ID No.l 1 . 

A vector, wherein the vector contains a nucleic acid selected from the group 
consisting of a nucleic acid according to the embodiment 3, a nucleic acid 
coding for a polypeptide according to the SEQ ID No. 1-9, a nucleic acid 
according to the SEQ ID No. 19. 

A vector according to the embodiment 6, wherein the vector is selected from 
the group consisting of a knock-out gene construct, a plasmid, a shuttle 
vector, a phagemid, a cosmid, a viral vector, and an expression vector. 

A cell, wherein the cell contains a nucleic acid according to the embodiment 
3. 

A cell, wherein the cell is transformed with a vector according to the embo- 
diment 6. 

A cell according to the embodiment 9, wherein the cell is a transgenic em- 
bryonic non-human stem cell. 
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11. A transgenic non-human mammal, wherein the transgenic non-human 
mammal contains the nucleic aicid according to the embodiment 3. 

12. An antibody or an antibody fragment thereof, wherein the antibody is di- 
rected against a polypeptide according to the embodiment 1 or against a 
nucleic acid coding for the polypeptide. 

13. A nucleic acid which comprises a nucleic acid having a sequence comple- 
mentary to a nucleic acid according to the embodiment 3 or a non-functional 
mutant variant of the nucleic acid according to the embodiment 3. 

14. A nucleic acid according to the embodiment 13, wherein nucleic acid having 
a complementary sequence is an antisense molecule or an RNA interference 
molecule. 

15. A vector, wherein the vector contains a nucleic acid according to the embo- 
diment 13. 

16. A vector according to the embodiment 15, wherein the vector is selected 
from the group consisting of a plasmid, a shuttle vector, a phagemid, a cos* 
mid, a viral vector, and an expression vector. 

17. A cell, wherein the cell contains a nucleic acid according to the embodiment 
13. 

18. A cell, wherein the cell is transformed with a vector according to the embo- 
diment 15. 

19. A diagnostic, wherein the diagnostic contains at least one compound selec- 
ted from the group consisting of a polypeptide according to the embodiment 
1, a polypeptide according to SEQ ID No. 1 to 9, a nucleic acid encoding 
one of the aforementioned polypeptides, a nucleic acid coding for the SEQ 
ID No. 19, a variant of one of the aforementioned nucleic acids, an antibody 
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directed against the polypeptide, and a fragment of the antibody, combined 
or together with suitable additives or auxiliaries.- • 

A diagnostic according to the embodiment 19, wherein the nucleic acid is a 
probe, preferentially a DNA probe. 

A pharmaceutical composition, wherein the pharmaceutical composition 
contains at least one component selected from the group consisting of a po- 
lypeptide according to the embodiment 1, a polypeptide according to SEQ 
ID No. 1 to 9, a functional variant of the polypeptide, a nucleic acid enco- 
ding one of the aforementioned polypeptides, a nucleic acid coding for the 
SEQ ID No. 19, a variant of one of the aforementioned nucleic acids, a 
nucleic acid which is a non-functional mutant variant of one of the afore- 
mentioned nucleic acids, a nucleic acid having a sequence complementary 
to one of the aforementioned nucleic acids, a vector containing one of the 
aforementioned nucleic acids, a cell containing one of the aforementioned 
nucleic acids, a cell containing the vector, an antibody directed against the 
polypeptide, a fragment of the antibody, a vector containing a nucleic acid 
coding for the antibody, a cell containing the vector containing a nucleic 
acid coding for the antibody, and a ceil containing the vector containing a 
nucleic acid coding for the antibody fragment, combined or together with 
suitable additives or auxiliaries. 

A pharmaceutical composition according to the embodiment 21, wherein the 
nucleic acid having a complementary sequence is an antisense molecule or 
an RNA interference molecule. 

A method of diagnosis of a liver disorder or an epithelial cancer, wherein at 
least one compound selected from the group consisting of a polypeptide ac- 
cording to the sequence of SEQ ID No. 1 to SEQ ID No. 9, a functional va- 
riant thereof, a nucleic acid encoding the polypeptide, a nucleic acid accor- 
ding to SEQ ID No. 19, a variant of one of the aforementioned nucleic 
acids, an antibody directed against the polypeptide, and a fragment of Hie 
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antibody, is identified in the sample of a patient and compared with at least 
one compound of a reference library or of a reference sample. 

24. A method according to one of the embodiment 23, wherein the liver disor- 
ders disorder, is a disorder selected from the group consisting of cirrhosis, 
alcoholic liver disease, chronic hepatitis, Wilson's Disease, heamochroma- 
tosis, hepatocellular carcinoma, benign liver neoplasms, and focal nodular 
hyperplasia. 

25. A method according to one of the embodiments 23, wherein the epithelial 
cancer is an adenocarcinoma of any organ other than liver, preferably of an 
organ selected from the group consisting of the lung, the stomach, the kid- 
ney, the colon, the prostate, the skin, and the breast. 

26. A method of treating a patient suffering from a liver disorder or an epithelial 
cancer, wherein at least one component selected from the group consisting 
of a polypeptide according SEQ ID No. 1-9, a functional variant of the po- 
lypeptide, a nucleic acid encoding the polypeptide, a nucleic acid encoding 
the functional variant, a nucleic acid coding for the SEQ ID No. 19, a vari- 
ant of one of the aforementioned nucleic acids, a nucleic acid which is a 
non-functional mutant variant of one of the aforementioned nucleic acids, a 
nucleic acid having a sequence complementary to one of the aforementioned 
nucleic acids, a vector containing one of the aforementioned nucleic acids, a 
cell containing one of the aforementioned nucleic acids, a cell containing the 
vector, an antibody directed against one of the aforementioned polypeptides, 
an antibody directed against a functional variant of one of the aforemen- 
tioned polypeptides, a fragment of one of the aforementioned antibodies, a 
vector containing a nucleic acid coding for one of the aforementioned anti- 
bodies, a vector containing a nucleic acid coding for one of the aforemen- 
tioned antibody fragments, a cell containing the vector containing a nucleic 
acid coding for one of the aforementioned antibodies, and a cell containing 
the vector containing a nucleic acid coding for one of the aforementioned 
antibody fragments, is administered to the patient in need of a the treatment 
in a therapeutically effective amount. 
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27. A method of treating according to the embodiment 26, wherein the nucleic 
acid having a complementary sequence is an antisense molecule or an RNA 
interference molecule. 

28. A method of treating according to the embodiment 27, wherein the RNA 
interference molecule is administered in the form of a double stranded RNA 
or a vector expressing the double stranded RNA 

29. The method according to the embodiment 28, wherein the RNA interference 
molecule has a size range selected from the group consisting of from 15 to 
30 nucleotides, from 20 - 25 nucleotides, and from 21-23 nucleotides. 

30. A method according to one of the embodiment 26, wherein the liver disor- 
der, is a.disorder selected from the group consisting of cirrhosis, alcoholic 
liver disease, chronic hepatitis, Wilson's Disease, heamochromatosis, he- 
patocellular carcinoma, benign liver neoplasms, and focal nodular hyperpla- 
sia. 

31., A method according to one of the embodiments 26, wherein the epithelial 
cancer is an adenocarcinoma of any organ other than liver, preferably of an 
organ selected from the group consisting of the lung, the stomach, the kid- 
ney, the colon, the prostate, the skin, and the breast. 

32. A method of stimulating an immune response a patient suffering from a 
liver disorder or an epithelial cancer to a polypeptide according to the se- 
quence of SEQ ID No. 1 to SEQ ID No. 9 or a functional variant thereof, 
wherein at least one component selected from the group consisting of a po- 
lypeptide according to the sequence of SEQ ID No. 1 to SEQ ID No. 9, a 
functional variant thereof, a nucleic acid encoding the polypeptide, a nucleic 
acid coding for the SEQ ID No. 19, a variant of one of the aforementioned 
nucleic acids, a vector containing one of the aforementioned nucleic acids, a 
cell containing one of the aforementioned nucleic acids, and a cell contai- 
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ning the vector, is administered to the patient in need of such treatment in an 
amount effective to stimulate the immune response in the patient. 

A method for identifying at least one nucleic acid according to SEQ ID No. 
10 to SEQ ID No. 19 or a variant thereof differentially expressed in a 
sample isolated from a patient relative to a reference library or a reference 
sample comprising the following steps: 

1. Detecting the expression of at least one nucleic acid according to 
SEQ ID No. 10 to SEQ ID No. 19 or a variant thereof in a sample 
isolated from a patient, 

2. Comparing the expression of at least one nucleic acid detected in 
step 1 with the expression of the same at least one nucleic acid in a 
reference library or in a reference sample, 

3. Identifying this at least one nucleic acid which is differentially ex- 
pressed in the sample isolated from the patient compared to the refe- 
rence library or the reference sample. 

A method of diagnosing a liver disorder or an epithelial cancer comprising 
the following steps: 

1. Detecting the expression of at least one nucleic acid according to 
SEQ ID No. 10 to SEQ ID No. 19 or a variant thereof in a sample 
isolated from a patient, 

2. Comparing the expression of at least one nucleic acid detected in 
step 1 with the expression of the same at least one nucleic acid in a 
reference library or in a reference sample, 

3. Identifying this at least one nucleic acid which is differentially ex- 
pressed in the sample isolated from the patient compared to the refe- 
rence library or the reference sample, and 

4. Matching the at least one nucleic acid identified in step 3 with at 
least one nucleic acid differentially expressed in a pathologic refe- 
rence sample or pathologic reference library, 
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wherein the matched nucleic acid(s) are indicative of the patient suffering 
from a liver disorder, or an epithelial cancer. 

35. A method according to the embodiment 34, wherein at least 2, at least 3, at 
least 4 at least 5, at least 6, or at least 7 nucleic acids are identified. 

36. A method according to the embodiment 34, wherein the at least one nucleic 
acid is detected by PCR based detection or by a hybridization assay. 

37. A method according to the embodiment 34, wherein the expression of the at 
least one nucleic acid is compared by a method selected from the group con- 
sisting of solid-phase based screening methods, hybridization, subtractive 
hybridization, differential display, and RNase protection assay. 

38. A method according to the embodiment 34, wherein the sample isolated 
from the patient is selected from the group consisting of liver tissue, a liver 
cell, tissue from another organ subject to cancerous transformation, a cell 
from this organ, blood, serum, plasma, ascitic fluid, pleural effusion, cere- 
bral spinal fluid, saliva, urine, semen, and feces. 

39. A method according to the embodiment 34, wherein the reference sample is 
isolated from a source selected from a non-diseased sample of the same pa- 
tient, and a non-diseased sample from another subject. 

40. A method according to the embodiment 34, wherein the reference sample is 
selected from the group consisting of liver tissue, a liver cell, blood, serum, 
plasma, ascitic fluid, pleural effusion, cerebral spinal fluid, saliva, urine, 
semen, and feces. 

41. A method according to the embodiment 34, wherein the reference library is 
an expression library or a data base containing clones or data on liver disor- 
der-specific expression of the at least one nucleic acid. 
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42. A method according to one of the embodiments 34, wherein the pathologic 
reference sample is isolated from a source selected from a diseased sample 
from another patient suffering from a liver disorder, and epithelial cancer 

43. A method according to one of the embodiments 34, wherein the pathologic 
reference library is a data base containing data on differential expression of 
the at least one nucleic acid according to the invention in samples isolated 
from another patient, suffering from a liver disorder or epithelial cancer re- 
lative to control expression in a reference sample or reference library. 

44. A method according to one of the embodiment 34, wherein the liver disor- 
ders, is a disorder selected from the group consisting of hepatocellular 
carcinoma, benign liver neoplasms, and cirrhosis. 

45. A method according to one of the embodiments 34, wherein the epithelial 
cancer is an adenocarcinoma of an organ selected from the group consisting 
of the lung, the stomach, the kidney, the colon, the prostate, the skin and the 
breast 

46. A method for identifying at least one polypeptide according to SEQ ID No. 
1 to SEQ ID No. 9 or a functional variant thereof differentially expressed in 
a sample isolated from a patient relative to a reference library or a reference 
sample comprising the following steps: 

1. Detecting the expression of at least one polypeptide according to 
SEQ ID No. 1 to SEQ ID No. 9 or a functional variant thereof in a 
sample isolated from a patient, 

2. Comparing the expression of at least one polypeptide detected in 
step 1 with the expression of the same at least one polypeptide in a 
reference library or in a reference sample, 

3. Identifying this at least one polypeptide which is differentially ex- 
pressed in the sample isolated from the patient compared to the refe- 
rence library or the reference sample. 



t 



-86- 

47. A method of diagnosing a liver disorder or epithelial cancers comprising the 
following steps: * ....... 

1. Detecting the expression of at least one polypeptide according to 
SEQ ID No. 1 to SEQ ID No. 9 or functional variants thereof in a 
sample isolated from a patient, 

2. Comparing the expression of at least one polypeptide detected in 
step 1 with the expression of the same at least one polypeptide in a 
reference library or in a reference sample, 

3. Identifying this at least one polypeptide which is. differentially ex- 
pressed in the sample isolated from the patient compared to the refe- 
rence library or the reference sample, and 

4. Matching the at least one polypeptide identified in step 3 with at 
least one polypeptide differentially expressed in a pathologic refe- 
rence sample or pathologic reference library, 

wherein the matched polypeptide(s) are indicative of the patient suffering 
from a liver disorder, or an epithelial cancer. 

48. A method according to the embodiment 47, wherein at least 2, at least 3, at 
least 4, at least 5, at least 6, or at least 7 polypeptides are identified. 

49. A method according to the embodiment 47, wherein the polypeptides are 
detected by a method selected from the group consisting of gel electropho- 
resis, chromatographic techniques, immunoblot analysis, immunohistoche- 
mistry, enzyme based immunoassay, surface plasmon resonance, HPLC, 
mass spectroscopy, immunohistochemistry, and enzyme based immunoas- 
say. 

50. A method according to the embodiment 47, wherein the polypeptides are 
compared by a method selected from the group consisting of two dimensio- 
nal gel electrophoresis, chromatographic separation techniques, immunoblot 
analysis, surface plasmon resonance, immimohistochemistry, and enzyme 
based immunoassay. 
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51. A method according to the embodiment 47, wherein the sample isolated 
from a patient is selected from the group consisting of liver tissue, a liver 
cell, tissue from another organ subject to cancerous transformation, a cell 
from this organ, blood, serum, plasma, ascitic fluid, pleural effusion, cere- 
bral spinal fluid, saliva, mine, semen, and feces. 

52. A method according to the embodiment 47, wherein the reference sample is 
isolated is from a source selected from a non-diseased sample of the same 
patient, and a non-diseased sample from another subject. 

53. A method according to the embodiment 47 wherein the reference sample is 
selected from the group consisting of liver tissue, a liver cell, blood, serum, 
plasma, ascitic fluid, pleural effusion, cerebral spinal fluid, saliva, urine, 
semen, and feces. 

54. A method according to the embodiment 47, wherein the reference library is 
an expression library or a data base containing clones or data on liver disor- 
der-specific expression of the at least one polypeptide. 

55. A method according to one of the embodiments 47, wherein the pathologic 
reference sample is isolated from a source selected from a diseased sample 
from another patient suffering from a liver disorder, and epithelial cancer. 

56. A method according to one of the embodiment 47, wherein the pathologic 
reference library is a data base containing data on differential expression of 
the at least one polypeptide according to the invention in samples isolated 
from another patient, suffering from a liver disorder or epithelial cancer re- 
lative to control expression in a reference sample or reference library. 

57. A method according to one of the embodiment 47, wherein the liver disor- 
ders is a disorder selected from the group consisting of hepatocellular 
carcinoma, benign liver neoplasms, and cirrhosis, 
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58. A method according to the embodiment 47, wherein the epithelial cancer is 
an adenocarcinoma of an organ selected from the group consisting of the 
lung, the stomach, the kidney, the colon, the prostate, the skin, and the 
breast, 

59. A method of preventing a patient from developing a liver disorder or an 
epithelial cancer, wherein at least one component selected from the group 
consisting of a polypeptide according to the sequence of SEQ ID No. 1 to 
SEQ ID No. 9, a functional variant thereof, a nucleic acid encoding the po- 
lypeptide, a nucleic acid coding for the SEQ ID No. 19, a variant of one of 
the aforementioned nucleic acids, a nucleic acid having a sequence com- 
plementary to one of the aforementioned nucleic acids, a nucleic acid which 
is a non-functional mutant variant of one of the aforementioned nucleic 
acids, a vector containing one of the aforementioned nucleic acids, a cell 
containing one of the aforementioned nucleic acids, and a cell containing the 
vector, is administered to the patient in need of such preventive treatment in 
a therapeutically effective amount. 

60. A method of identifying at least one pharmacologically active compound 
comprising the following steps: 

1. providing at least one polypeptide according to the SEQ ID No. 2 or 
a functional variant thereof 

2. contacting the at least one polypeptide, with at least one putative 
pharmacologically active compound, 

3. assaying the interaction of the at least one putative pharmacologi- 
cally active compound and the at least one polypeptide, 

4. identifying the at least one pharmacologically active compound 
among the assayed at least one putative pharmacologically active 
compound by selecting those assayed at least one putative pharma- 
cologically active compound which directly or indirectly functio- 
nally interact with the at least one polypeptide. 
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61 . The method according to the embodiment 60, wherein the at least one poly- 
peptide is provided in a form selected from the group of the at least one po- 
lypeptide is attached to a column, the at least one polypeptide is attached to 
an array, the at least one polypeptide is is contained in an electrophoresis 
gel, the at least one polypeptide is attached to a membrane, the at least one 
polypeptide is attached to a column, and the at least one polypeptide is ex- 
pressed by a cell. 

62. The method according to the embodiment 60, wherein the interaction is 
assayed by a method selected from the group of enzyme and fluorescence 
based cellular reporter assays in which interaction of the at least one phar- 
macological substance with a recombinant fusion protein including the at 
least one polypeptide is detected. The interaction may preferably also be as- 
sayed by surface plasmon resonance, HPLC and mass spectroscopy. 

63 . The method according to the embodiment 60, wherein the direct or indirect 
functional interaction is selected from the group consisting of induction of 
the expression of the at least one polypeptide, inhibition of the expression 
of the at least one polypeptide, activation of the function of the the at least 
one polypeptide, inhibition of the function of the the at least one polypepti- 
de. 

Examples 

Example 1 ■ Identification of nucleic acids deregulated in liver disorders 

A unique approach employed in this invention utilized discrete, pathologist- 
confirmed liver cancer pathologies for production of disease specific cDNA libra- 
ries enriched in genes specifically up- and down-regulated in HCC compared with 
a pool of non-neoplastic human livers. The library is a genome-wide representation 
of deregulated gene expression in HCC and therefore includes all potential HCC 
deregulated genes. Repetitive hybridization to these library clones with labeled 
expressed nucleic acids from many additional discrete, pathologist-confirmed liver 
cancer samples (HCCs) and non-malignant liver lesions indicated nucleic acids 
highly deregulated in HCC. The surprising finding is that this approach provided 
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deregulated nucleic acids that had not previously been identified as well as many 
deregulated nucleic acids that were not before associated with HCC, the elevated 
expression of which can also be associated with other neoplasms. These HCC de- 
regulated genes are the subject of this invention. 

The screening and verification strategy is already inventive per se owing to the 
elaborate and defined choice of parameters. Identification of differentially ex- 
pressed genes according to the invention, relied upon histopathologically distin- 
guished liver disease tissue for comparison of gene expression changes in disorders 
of the human liver. Non-diseased reference liver samples for the experiments was 
also diagnostically confirmed and pooled from 3 independent samples to reduce 
detection of false positives resulting from inter-individual variations. 

Example 2. Preparation of HCC subtracted cDNA libraries. 

RNA was isolated from three pathologist-confirmed HCC tumor samples and from 
three pathologist-confirmed non-diseased human liver samples using the TRIZOL 
reagent (LifeTechnologies) according to standard methods (Chomczynski & Sac- 
chi, 1987, Anal. Biochem. 162:156-159). The tissues used for the generation of 
cDNA libraries was from patients that provided specific informed consent for uti- 
lization of this material for research purposes, including commercial research. 
mRNA was converted to double stranded cDNA with reverse transcriptase and 
DNA polymerase as described in the instructions provided in the "PCR select 
cDNA subtraction kit" from Clontech Laboratories. To enrich for cDNAs specifi- 
cally increased and decreased in HCC, cDNAs expressed in common and at similar 
levels in the reference liver pool and in HCC were removed by subtractive suppres- 
sive hybridization (SSH) according to the instructions provided in this kit and as 
described by Diatchenko et al. (1996, Proc. Natl. Acad. Sci. USA 93:6025-6030). 
The SSH steps were performed in both directions (subtracting non-diseased liver 
cDNAs from HCC cDNAs and subtracting HCC cDNAs from non-diseased liver 
cDNAs) so the resulting cDNA molecules represent nucleic acid sequences both 
up- and down-regulated in HCC but do not represent those that are not differen- 
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tially expressed. In addition a normalized but not subtracted HCC cDNA library 
was generated to better represent rare mRNA transcripts in HCC tissues. These 
cDNAs were separately cloned into the pCRII vector (Invitrogen) by ligation into 
this plasmid followed by. etectrophoretic transformation into E. coli XL- 1 -Blue 
electroporation-competent cells (Stratagene). The cloning was carried out as de- 
scribed by the supplier of the vector and competent cells. Cloned differentially 
expressed cDNAs were plated onto selective (ampicillin) media to isolate individu- 
al clones. 960 clones were isolated from each SSH library and 576 clones isolated 
from the normalized HCC library and cultures established in 96-well microtiter 
plates. Together these 2500 cDNA clones provide a unique representation of 
mRNA expression specific for human HCC tissue. 

Example 3. Preparation and hybridization of HCC cDNA microarravs. 

1 ml cultures of the SSH cDNA library clones described above were established 
and the cDNA inserts amplified by PCR with primers specific to the vector se- 
quence flanking the cDNA inserts. The M13 forward (5- 
GTAAAACGACGGCCAG-3';SEQ ID NO. 20) and Ml 3 reverse primers (5'- 
CAGGAAACAGCTATGAC-3'; SEQ ID NO. 21) were employed for the PCR 
amplification of clone inserts. Fifty microliters of the bacterial cultures were heat 
denatured at 95°C for 10 minutes, debris removed by centrifugation, and 2 j*l of the 
supernatant included in a standard PCR [IX Amplitaq PCR buffer, 2.5 mM MgCfe, 
37.5 nM each primer, 0.5 mM each of dATP, dCTP, dGTP and dTTP and 1.5 units 
Amplitaq DNA polymerase (Applied Biosystems)]. Reaction conditions were 95°C 
for 5 minutes followed by 35 cycles of: 94°C for 30 seconds, 60°C for 30 seconds, 
72°C for 60 seconds; then followed by 72°C for 7 minutes and then cooled to 4°C. 
Amplification of cDNA inserts was confirmed by electrophoresis of a 5% of the 
PCR on a 1% agarose gel containing 0.4 jig/ml ethidium bromide and run in IX 
Tris Acetate EDTA (TAE; 40mM Tris-acetate, ImM EDTA, pH 7.5) buffer . Each 
of the SSH clone amplified insert sequences were affixed to sialinized glass 
microscope slides (GAPS Corning) using a GeneticMicrosystems (now Affyme- 
trix) 417 cDNA arrayer robot to generate custom HCC cDNA microarrays. The 
protocol for spotting the cDNA inserts to the slides was according to that published 
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by Hedge et al. (2000, Biotechniques 29:548-560) except that PCR products were 
spotted directly from the PCR microtiter plates without purification or adjustment 
of the cDNA buffer. In addition to the SSH cDNA clone inserts, numerous control 
DNAs were spotted onto the microarrays as controls for hybridization reactions. 
Further, approximately 2000 publicly available cDNA clones corresponding to 
genes previously reported to be involved in cancer, especially liver cancer, were 
purchased from the German Genome Research Center (RZPD), expanded, ampli- 
fied and spotted onto these microarrays as described above. 

For preparation of hybridization probes, 20 micrograms of RNA from additional 
.pathology-confirmed liver disorders and from the same quantity of pooled non- 
diseased liver RNy^V was converted to cy5-fluorescence-labeled and cy3- 
fluorescence-labeled cDNA, , respectively (cy5-CTP and cy3-CTP, Pharmacia) 
using reverse transcriptase according to the standard methods (Hedge et al., 2000, 
Biotechniques 29: 548-560). Using this protocol, these labeled cDNAs were com- 
petitively hybridized to the HCC microarrays. Following prehybridization at 42°C 
for 45 minutes in 5X SCC (0.75 M sodium citrate, 75 mM sodium citrate, pH 7.0); 
0.1% SDS (sodium dodecyl sulfate) and 1% BSA (bovine serum albumin), the 
hybridization was carried out overnight at 42°C in buffer containing 50% formami- 
de, 5XSSC, and 0.1% SDS. Hybridized slides were washed in stringent conditions 
(twice at 42°C in IX SSC, 0.1% SDS for 2 minutes each; twice at room tempera- 
ture in 0.1X SSC, 0.1% SDS for 4 minutes each; and twice at room temperature in 
0.05X SSC for 2 minutes each), dried and analyzed with the GeneticMicrosystems 
(Affymetrix) 418 cDNA microarray scanner and associated Imagene 4.1 image 
analysis software according to the manufacture's recommendations. 

Example 4. Identification of deregulated, disease-relevant nucleic acids. 

Deregulated cDNAs according to the invention include those that were expressed 
by at least 2-fold higher levels in more than 50% of the 21 HCC samples subjected 
to the cDNA microarray expression profiling (Figure 21). These cDNAs inserts 
were sequenced and compared with the non-redundant and EST databases of Gen- 
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Bank using the BLAST algorithm (Altschul et al., 1997, Nucleic Acids Res., 
25:3389-3402). The nucleic acids according to the invention were selected from the 
deregulated nucleic acids that fulfilled this criteria and the sequences that pre- 
ferably have not previously been associated with liver disorders, especially with 
HCC. Expression values do not represent a normal distribution. However, evaluati- 
on of the values with nonparametric measures demonstrates significant differences 
in gene expression for the nucleic acids according to the invention (Figures 22-24). 
Moreover, comparison of data between liver disorders indicates differences in ex- 
pression values in proliferative hyperplastic and malignant neoplastic liver disorder 
samples (Figure 24). Verification of deregulated expression in HCC of the nucleic 
acids according the invention was accomplished with independent methods as de- 
scribed in detail below. 

Extensive patient information has also been assembled for each patient profiled. 
This information includes parameters such as age, sex, diagnosis, tumor grading 
scale parameters, tumor size, tissue ischemia time, alcohol use status, and follow 
up information (Figure 25 includes some of the diagnostic information available 
for patient samples employed for RT-PCR validation of deregulated nucleic acid 
expression in HCC). 

In addition to providing a pool of candidate cDNAs highly enriched for differen- 
tially expressed genes, the SSH library represents all differentially expressed genes 
in HCC, including those not expressed in non-diseased liver. Consequently the 
SSH libraries include sequences that would not appear in conventional cDNA 
libraries or in genome-wide cDNA microarrays from public domain cDNA clone 
collections. Together these parameters better illustrate nucleic acids specifically 
deregulated in disorders according to the invention. 

The regulation of the polypeptides and nucleic acids according to the invention is 
preferably essential for the pathologic process and which are thus in a direct or 
indirect relationship with diagnosis, prevention and/or treatment of liver disorders. 
The polypeptides of these genes and the nucleic acids encoding these do not belong 
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to the targets known until now for diagnosis, prevention and/or the treatment of 
. liver disorders, or other cancers as indicated above, such that surprising and com- 
pletely novel therapeutic approaches result from this invention. 

Example 5: Independent verificati on of differential expression of the nucleic acids 
according to the invention 

RNA was isolated from human patient samples as described in detail above. HCC 
samples for this analysis were not. from the same patients as employed, for produc- 
tion of the HCC SSH library or for cDNA microarray chip hybridization (see ex- 
amples above, Figures 21 - 23). In addition to HCC samples, RNA was prepared 
from independent non-diseased liver samples to assess expression of the nucleic 
acids according to the invention in non-diseased liver tissue. Further, RNA was 
prepared from additional non-diseased and cancer tissues to assess expression of 
the nucleic acids according to the invention in other normal human tissues and 
other human cancers. 1 ug of RNA was converted to single-strand cDNA with the 
aid of Superscript reverse transcriptase (LifeTechnologies) in dATP, dCTP, dGTP, 
and dTTP (0.4 mM each), 7.5 nM random 6-nucleotide primer (hexamers), 10 mM 
dithiothreitol and 1 unit RNAse inhibitor using standard procedures known in the 
art (Sambrook et al., Molecular Cloning, 2 nd ed., 1989, Cold Spring Harbor Press, 
NY, USA, pp. 5.52-5.55). The presence or absence of the nucleic acids according 
to the invention was then determined by amplification of these sequences from the 
cDNA with primer pairs specific to each nucleic acid according to the invention in 
PCR experiments. The primers used for this analysis are given in the following 
table: 



Clone 


SEQIDNo. 


Primer! (SEQIDNO.) 


Primer 2 
(SEQIDNO.) 


OBcll 


10 


5»- 

CAGGTGAATTTCAAAGG 
AGGATTTACTCAC- 3' (22) 


5'-GTGAGTAAATCCTCCTT 
TGAAATTCACCTG-3' (23) 


OBcIS 


11 


5'- 


5-TGCCAGGAAACTTCTTG 
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GCAAGCCAGOAAGAGT 
CGTCACG-3' (24) 


CTTGATGC-3' (25) 


IK2 


12 


5'- 

AGTAACCAGTTGAGATG 
AAGCACGTC-3' (26) 


5-CAGAAGAGCAACAAGA 
ATGGTATCCTGC-3 ' (27) 


IK5 


13 


5'- 

AACTTGAGTTCTATTTAC 
CTTGCAC-3' (28) 


5-TTGCTTGGGTCATCTAA 
AGAC-3' (29) 


DAP3 


14 


5'- 

ACTCACGTGCAAGGATG 
ATG-3' (30) 


5-AGCTCTCGGACTCTCAA 
CTG-3'(31) 


LOC5 


15 


5'- 

CTTCTCCTATGACTGATC 
CTACTATG-3' (32) 


5-CAGGATGCAGAACTCAC 
CCTG-3' (33) 


SEC14L 
2 


16 


5'- 

GCAGATTTCCCGTGGCT 
CCTC-3' (34) 


5-GTTGGGCAGCACCTCTG 
TCATC-3' (35) 


SSP29 


17 


5'- 

CTGTGACATTCCGCCTTC 
CTTC-3' (36) 


5'-CCACGCTACTGCAAGAA 
TCTTAC-3' (37) 


HS16 


18 


5'- 

AGAAGTTCAACCTGGAG 
AGATGG-3' (38) 


5-CAAGGAAGCTAGGAATG 
ACAGGAG-3* (39) 


IK3 


19 


5'- 

GCAAAGCCAAATTCATG 
TTACTCT-3' (40) 


5'-CAGATACGAACAGTGAA 
TGGAAATACG-3' (41) 



These primers are also empoyable for diagnosis of disorders according to the in- 
vention, but the skilled worker may as well design other primers specific for a gi- 
ven nucleic acid according to the invention. The PCR included 0.5% of the cDNA, 
IX Amplitaq PCR buffer, 2.5 mM MgCfe, 37.5 nM each primer, 0.5 mM each of 
dATP, dCTP, dGTP and dTTP and 1.5 units Amplitaq DNA polymerase (Applied 
Biosystems). PCR conditions were optimized as needed for each primer pair, typi- 
cally: 94°C for 3 minutes followed by 30 cycles of: 94°C for 15 seconds, 60°C for 
30 seconds, 72°C for 60 seconds, then cooled to 4°C. Amplification of cDNA 
inserts was confirmed by electrophoresis of a 5-10% of the PCR on a 1% agarose 
gel containing 0.5 |Ag/ml ethidium bromide and run in IX Tris Acetate EDTA 
(TAE) buffer. Standard controls for RT-PCR including RNAse treatment of 
samples prior to cDNA synthesis and omission of reverse transcriptase routinely 
demonstrated the specificity of these reactions. Reactions were scored for expressi- 
on (+) or absence of expression (-) based upon whether a discrete band of the cor- 
rect molecular size was observed in the gel. Very faint or ambiguous bands under 
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these conditions were scored with (+/-). A summary of these verification studies in 
HCC and non-diseased liver is given in Figure 25. Data representative of these 
analyses in independent HCC and non-diseased liver samples is provided in Figure 
26. 

An additional independent validation of differential expression of the nucleic acids 
according to the invention is illustrated in Figure 27. In this case, 15 ng of RNA 
from two HCC samples and frpm non-diseased liver was subjected to denaturing 
electrophoretic separation on a 1% agarose gel containing 2.2 M formaldehyde and 
IX MOPS buffer (10 mM 4-morpholinepropanesulfonic acid, 1 mM EDTA, 5 mM 
sodium acetate, pH 7.0) run in IX MOPS buffer. The size-fractionated denatured 
RNA was transferred to nylon membrane (GeneScreen, New England Nuclear) 
with the RNA (northern) blot technique and cross linked to the membrane with UV 
light, all according to procedures well known to the skilled artisan (Sambrook et 
al., Molecular Cloning, 2 nd ed., 1989, Cold Spring Harbor Press, NY, USA, pp 
7.39-7.52). cDNA clone inserts of from the SSH clones for OBcIl and OBcl5 
(SEQ ID Nos. 10 and 1 1) were isolated by PCR amplification as described in the 
previous example. Single stranded radiolabeled RNA probes corresponding to the- 
se sequences were synthesized from this template using SP6 and T7 RNA polyme- 
rase in the presence of a- 32 P-UTP in IX labeling buffer: 0.5 mM ATP, CTP, GTP, 
10 mM dithiotreitol, and 20 units of appropriate RNA polymerase at 37°C for 35 
minutes. The resulting antisense probe is complementary to the corresponding 
mRNA sequence and thus expected to hybridize specifically to the mRNA se- 
quence on the northern blot. Conversely, the sense probe sequence matches that of 
the mRNA and thereby is expected not to hybridize to the mRNA. Identical nort- 
hern blots were prehybridized in 15 ml of 250 mM monobasic sodium phosphate, 
250 mM dibasic sodium phosphate, 7% SDS, 1 mM EDTA and 1% BSA for at 
least 30 minutes at 68°C. For hybridization the prehybridization buffer was remo- 
ved and replaced with 10 ml of the same buffer including the sense and each anti- 
sense RNA probes described above at 68°C overnight as described by the supplier 
of the buffer. The RNA blots were washed under stringent conditions (2X SSC, 
0.1% SDS twice at room temperature for 15 minutes each; IX SSC, 0.1% SDS 
twice at 68°C for 10 minutes each), dried and exposed to x-ray film to produce an 
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autoradiography As seen in Figure 27, each antisense probe specifically hybridizes 
to discrete HCC RNA but only weakly or not at all to non-diseased liver RNA The 
specificity of these results is demonstrated by the absence of specific signal from 
the corresponding sense probe for both OBcll and OBclS. In addition, RNAs of 
different molecular weights are apparent with the OBcll antisense probe. This re- 
sult most likely represents discrete mRNA species, perhaps produced by alternative 
splicing. These species were expected based upon the finding that several different 
sized cDNA clones corresponding to this sequence are reported in the GenBank 
sequence database. 

These data provide independent verification of deregulated expression of the 
nucleic acids according to the invention in HCC. Expression of the nucleic acids 
according to the invention is either absent or observed only at very low levels in 
non-diseased liver, thereby validating the differential expression of these nucleic 
acids identified by hybridization to the cDNA microarray. The results provide sur- 
prising evidence that the nucleic acids according to the invention can be used to 
diagnose, prevent and/or treat disorders according to the invention. 

Example 6: A method of diagnosing using HCC specific probes. 

A diagnostic method for disorders according to the invention preferably based on 
the polymerase chain reaction (PCR) can be established. A standard PCR detection 
of nucleic acid sequences of the invention can be sufficient to identify, for examp- 
le, circulating HCC tumor cells in the blood stream of the patient. Detection of 
expression of nucleic acid sequences of the invention in tumor biopsy material 
however, such as from a fine needle biopsy, would also be a preferred indication 
for this diagnostic procedure. Nucleic acid sequences of the invention, OBcl5 (SEQ 
ID No, 1 1) for example, are not detected in most non-diseased tissues and relati- 
vely specifically expressed in e.g. HCC. Elevated expression of this nucleic acid in 
cirrhosis and HCC is also demonstrated indicating the potential discriminatory 
power of such an approach for differential diagnosis of liver diseases (Figure 24). 
The PCR diagnostic would preferably require approximately 1 pg, preferably at 
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least 100 ng, more preferably at least 1 jig of RNA isolated from patient material. 
In the preferred utilization the RNA would be isolated according to standard proce- 
dures from e.g. the white blood cell fraction preferably from circulating blood ob- 
tained by the minimally invasive venupuncture procedure. In this preferred case, 
the procedure would detect the presence of HCC tumor cells in the blood circulato- 
ry system. RNA could similarly be isolated from liver biopsy material. For specific 
detection of OBclS, for example, the PCR diagnostic would include several primers 
specific for OBcl5 nucleic acid sequence, including a specific antisense primer 
(Primer OBcl5-pl; 5'- GCCACAGGTTGAACACTTAATTTG-3 , ; SEQ ID NO. 
42; from nucleotide .350-327 on SEQ ID No. 11) fpr.cDNA synthesis from the 
RNA generated from the patient sample. Similarly specific PCR primers such as 
for example OBcl5-p2 (5 ' -AGGAAGAGTCGTC ACGAGAACC-3 SEQ ID NO. 
43; from nucleotide 107-128 on SEQ ID No.ll) and OBcl5-p3 (5'- 
ATAATGCTGTGCTTAGTTTATTGCC-3 , ; SEQ ID NO. 44; from nucleotide 
313-289 on SEQ ID No.ll). Sensitivity, specificity and quality control may be 
improved by the provision of an additional primer set (for example: OBcl5-p4; 5- 
GATCGTGGACATTTCAACCTC-3 1 ; SEQ ID NO. 45; from nucleotide 147-167 
on SEQ ID No.ll and OBcl5-p5; 5'-TCTTGCTTGATGCTTTGGTC-3 , ; SEQ ID 
NO. 46; from nucleotide 280-261 on SEQ ID No.l 1) that are specific for the OBclS 
nucleic acid insert and internal (nested) to primers OBcl5-p2 and -p3. 

cDNA may be prepared from the patient RNA sample following digestion of the 
RNA with RNAse-free DNAse-l (Roche) to eliminate potential contamination by 
genomic DNA. This contamination possibility is further controlled by including 
primers for PCR amplification from sequences of different exons of the OBclS 
gene such that PCR products resulting from a genomic DNA template (and thereby 
not reflective of expression of the mRNA corresponding to OBcl5) would be larger 
than the RNA specific PCR products. cDNA synthesis can e.g. be primed by the 
OBclS specific OBclS-pl (at about 1 \xM) with the aid of reverse transcriptase 
[such as Maloney murine leukemia virus reverse transcriptase (Roche) at about 2 
unit/reaction] in an appropriate buffer such as 50 mM Tris-HCI, 6 mM MgC12, 40 
mM KC1, and 10 mM dithiotreitol, pH 8.5. Also required in the cDNA synthesis 
reaction is dATP, dCTP, dGTP and dTTP, each at about 1 mM, RNAse inhibitor, 
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such as placental KNAse inhibitor (Roche) at about 1-10 units/reaction, cDNA 
synthesis would be preferably carried out at 42°C for 30 to 60 minutes followed by 
heating at 95°C for 10 minutes to denature the RNA template. The resulting cDNA 
can be employed as the template for a PCR to detect OBclS in the blood (or liver 
biopsy sample). The additional reagents required for PCR detection of OBclS 
would preferably also be provided including: 10X Taq DNA polymerase buffer 
(500 mM Tris-Cl pH 8.3, 25 mM MgCI 2 , 0.1% Triton X-100); a mixture of dATP; 
dCTP, dGTP and dTTP for a final concentration of 0.2 mM each; Taq DNA poly- 
merase (2.5U/reaction), and OBclS specific primers such as OBcl5-p2, OBcl5-p3, 
OBcl5~p4, and OBcl5-p5 (0.1-1 \xM final concentration). A positive control for 
PCR amplification such DNA from a plasmid clone with the OBclS sequence insert 
would preferably also be included (1-10 ng/reaction). The PCR can e.g. be carried 
out over 22-40 cycles of 95°C for 30 seconds, 60°C for 30 seconds, 72°C for 60 
seconds. As indicated above, preferred additional sensitivity and specificity may be 
achieved in this diagnostic procedure by utilization of the additional OBclS primer 
set located within the sequence amplified with the original PCR primer set. In this 
case a subsequent PCR under conditions similar to those utilized in the first PCR 
reaction except that preferably primers OBcl5-p4 and OBcl5-p5 would be em- 
ployed to amplify the nested sequence in a reaction that included 1-10 \il of the 
first PCR as the template DNA. Alternatively, the reaction may preferably be car- 
ried with the first primer set (OBcl5-p2 and OBcl5-p3) for 10-15 cycles after which 
and 1-10 fil of this reaction then included as template in a new PCR reaction with 
primers OBcl5-p4 and OBcl5-p5 (and including all the necessary PCR compo- 
nents). Detection of OBclS specific PCR produces) should preferably utilize aga- 
rose gel electrophoresis as is known in the art and described in previous examples. 
Included in the diagnostic should preferably be a comparable fluid or tissue extract 
as a control for such PCR-based diagnostic test This may include serum or plasma 
from non-diseased individuals and/or serum, plasma or tissue extracts from an ap- 
propriate animal model. If the PCR-determined expression of the nucleic acid ac- 
cording to the invention such as the product of the reaction with primers OBclS-p4 
and OBcl5-p5 is upregulated in the sample isolated from the patient relative to the 
control and if in particular the upregulated expression essentially matches the di- 
sorder specific (mean) expression ratios such as those illustrated in figure 21 then 
such matching is indicative of the patient suffering from the disorder. 
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Variations on this approach can also be appreciated. The cDNA synthesis and PCR 
amplifications can be carried our sequentially or simultaneously in a single reaction 
vessel utilizing heat stabile DNA polymerases with reverse transcriptase activities, 
such as provided by the Titan one-tube or Carboxydothermus DNA polymerase 
one-set RT-PCR systems from Roche. Alternatively the PCR product can be mo- 
nitored by incorporation of fluorescently-labeled primers or various fluorescence- 
based indicators of PCR product including the Taqman probe hydrolysis systems 
and fluorescent double-stranded DNA intercalating molecules such as SYBR 
green. The fluorescent-based approaches provide advantage as the accumulation of 
PCR product can be continuously monitored to achieve sensitive quantitative as- 
sessment of expression of the nucleic acid according to the invention. This should 
be particularly advantageous for nucleic acids increased in blood or tissues of di- 
sorders according to the invention but also present at lower levels in non-diseased 
patients and tissues such that quantitative information about the level of expression 
of the nucleic acid is acquired. Further, as with this example, accurate quantitation 
of nucleic acid expression levels contributes to differential diagnosis, between cir- 
rhosis and HCC for example. Comparison of this data with supplied standards indi- 
cative of disease and absense of disease provides an important advantage for such a 
diagnostic procedure. 

Additional variations on this diagnostic strategy include simultaneous detection of 
multiple nucleic acids according to the invention and/or of nucleic acids according 
to the invention together with other nucleic acids implicated in the disorder. 
Further hybridization-based diagnostic detection of nucleic acids according to the 
invention is also envisioned. In this case mRNA detection preferably utilizing RNA 
blot, RNAse protection or in situ hybridization on patient cells or tissue biopsy 
samples is also effective. 

By similar methods and variants thereof the nucleic acids according to the inventi- 
on and/or of nucleic acids according to the invention together with other nucleic 
acids can be utilized for diagnosis of the disorders according to the invention. 
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Example 7: A method of diapnosins via antibody detection of polypeptides accor- 
ding to the invention 

A preferred diagnostic method for disorders according to the invention is based on 
antibodies directed against a polypeptide according to the invention. For example, 
a diagnostic procedure may preferably employ serum detection of specific upregu- 
lated gene proteins via enzyme-linked immunosorbent assay (ELISA) assay. In a 
simple form the diagnostic assay preferably includes an microliter plate or strip of 
microtiter wells, e.g., thoroughly coated with an isolated and purified antibody 
specific to a polypeptide according to the invention such as OBcl5.pr (SEQ ID No. 
2). The antibody may for example be an affinity purified polyclonal antibody, such 
as is commonly raised in rabbits, for example, or a purified monoclonal antibody 
such as is commonly produced in mice according to procedures well established in 
the art (Cooper, H.M. & Paterson, Y., (2000), In Current Protocols in Molecular 
Biology (Ansubel, F.A. et aL, eds.) pp. 11.12.1 - 11.12.9, Greene Publ. & Wiley 
Intersci., NY); (Fuller S.A. et aL, (1992), In Current Protocols in Molecular Biolo- 
gy (Ansubel, F.A. et at., eds.) pp. 11.4.1 - 11.9.3, Greene Publ. & Wiley Intersci., 
NY). Preferably, the antibody may a recombinant antibody obtained from phage 
display library panning and purification as has been described by Knappik et al. 
(2000, J. Molec. Biol. 296:57-86) or by Chadd and Chamow (2001 Curr. Opin. 
Biotechnol. 12:188-94) or a fragment thereof. The antibody coating is preferably 
achieved by dilution of the anti-OBcl5.pr antibody to 1-100 ug/ml in a standard 
coating solution such as phosphate buffered saline (PBS). The antibody is pre- 
ferably bound to the absorptive surface of the microtiter well (such as a Nunc Ma- 
xisorp immunoplate) for 60 minutes at 37°C, or overnight at room temperature or 
4°C. Prior to binding sample to the coated wells, the wells are preferably thoroug- 
hly blocked from non-specific binding by incubation for 15-60 minutes at room 
temperature in a concentrated protein solution such as 5% bovine serum albumin in 
phosphate buffered saline or 5% non-fat dry milk powder resuspended in the same 
buffer. Preferably, the patient sample material is then applied to the microtiter 
wells, diluted into the blocking solution to increase specificity of detection. The 
sample may be for example plasma or serum or protein extract from tissue biopsy 
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or surgical resection prepared according to methods well known in the art (Smith, 
J.A. (2001) In, Cunrent Protocols in Molecular Biology, Ausubel, F.A. et aL, eds) 
pp. 10.0.1- 10.0.23, Greene Publ. & Wiley Intersci., NY). In particular, the patient 
sample is brought into contact with the antibody-coated well for 30-120 minutes 
Tor longer) at room temperature or at 4°C. Non-specifically interacting proteins are 
preferably removed by extensive washing with a standard wash buffer such as 0.1 
M Tris-buffered saline with 0.02-0.1% Tween 20, for example. Washes are pre- 
ferably carried out for 3-10 minutes and repeated 3-5 times. Detection of OBclS.pr 
polypeptide in the patient sample is for example achieved by subsequent binding 
reaction with a second, independent anti-OBclS.pr antibody, generated as described 
above, recognizing a distinct epitope on the OBclS.pr polypeptide in the standard 
two-site 'sandwich* type ELISA, Binding of the second anti-OBcl5.pr antibody is 
for example achieved by incubating the wells in the antibody (at a concentration of 
1-100 ng/ml in blocking solution, for example) at room temperature for 30-60 mi- 
nutes followed by extensive washing as in the previous step. The second antibody 
may preferably be directly coupled to an enzyme capable of producing a colorige- 
nic or fluorgenic reaction product in the presence of an appropriate substrate, such 
as alkaline phosphatase. Alternatively, for example an anti-species and anti-isotype 
specific third antibody, so coupled to an enzyme, is employed to generate a reac- 
tion product that preferably can be detected in a standard spectrophotometric plate 
reader instrument. For the reaction product development, the washed (as above) 
antibody-antigen-enzyme complex is preferably exposed to the colorigenic sub- 
strate, such as AttoPhos from Roche for about 10 minutes at room temperature, the 
reaction may be stopped with a low pH buffer such as 50 mM Tris-HCl pH 5.5, or 
can instead be directly assayed. The amount of specifically bound OBclS.pr poly- 
peptide is for example determined by measurement of the amount of the enzymatic 
reaction product in each well following excitation at the appropriate wavelength in 
the spectrophotometer (420 nm in this case). Measurement is preferably made in 
the plate reader at the emission wavelength (560 nm in this case). Preferably inclu- 
ded in the diagnostic is an OBclS protein standard, such as purified recombinant 
OBclS.pr polypeptide, for example. A dilution series of this protein standard is 
preferably included in parallel in the ELISA as a control for the reactions and to 
deduce a protein standard curve for comparison of polypeptide expression levels as 
is well known in the art A concentration range corresponding indicative of the 
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particular liver disorder(s) should preferably be provided in the diagnostic. In addi- 
tion, a comparable fluid or. tissue extract should preferably also be included as a 
control for such ELISA test. This may preferably include serum or plasma from 
non-diseased individuals and/or serum, plasma or tissue extracts from an appro- 
priate animal modeL Such ELISA detection diagnostics are common in the art (see 
for example, Hauschild et aL, 2001, Cancer Res. 158:169-77). The samplexontrol 
protein levels determined by ELISA are compared with ELISA-determined disor- 
der specific protein expression ratio values preferably determined in pathologist- 
confirmed tissues of patients suffering from a disorder according to the invention in 
relation to control samples. In case the protein level of the samplexontrol essen- 
tially matches the disorder specific protein expression ratio values such matching is 
preferably indicative of the patient suffering from the disorder. Preferably such 
diagnosis is carried out for more than 1 polypeptide according to the invention. 

In addition the diagnostic may be directed to detecting an endogenous antibody 
directed against a polypeptide according to the invention or a functional variant 
thereof or fragment thereof present in the sample isolated from a patient which 
antibody or fragment thereof is directed against a polypeptide according to the in- 
vention. Detection of such autoimmune antibodies may be accomplished by me- 
thods generally known to the skilled artisan, e.g. by immunoaffinity assays such 
the ELISA described in detail above using polypeptides according to the invention 
or functional variants thereof or parts thereof as a probe. The presence of such au- 
toimmune antibodies is indicative of the patient suffering from a disorder accor- 
ding to the invention. 

In addition or alternatively, a relevant diagnostic kit based upon immunohistoche- 
mical detection of at least one polypeptide according to the invention can be for- 
mulated. In such a kit, for example a purified antibody or antibodies specific for the 
polypeptide(s) according to the invention can be included as well as preferably the 
reagents necessary to detect the binding of the antibody(ies) to patient cells or tis- 
sue sections. These reagents include, for example a specific anti-species and subty- 
pe specific secondary antibody -directed against a polypeptide according to the 
invention of a functional variant thereof- preferably coupled to an enzyme capable 



- 104- 

of catalysis of e.g. a colorigenic substrate or coupled to a fluorophore (such as Tex- 
as Red, for example). Perferably the enzymatic substrate would also be included as 
well as washing and incubation buffers. An additional optional component of such 
a kit may be a section of positive control tissue, e.g. liver, or tissues or a section 
from a packed pellet of cells specifically expressing the polypeptide(s) as a positive 
tissue control. Instructions provided would include preferred and/or alternative 
methods of antigen retrieval for detection of the polypeptide(s) according to the 
invention or e.g., indication that frozen, rather than formalin fixed and paraffin- 
embedded tissue material should be employed. In this case, recommendations 
would preferably be included for fixation of frozen tissue sample sections, such as 
immersion in ice-cold acetone for 10 minutes. Further instructions would pre- 
ferably provide recommendations for the concentration of antibodies to use in the 
detection of the gene produces) as well as e.g., recommended and suggested incu- 
bation times and temperatures for exposure of the tissue to the immunological rea- 
gents provided. Preferred reaction buffers for the antibody incubations, such as 
0.01% - 0.1% tween-20 containing phosphate buffered saline including 3% normal 
sheep serum, could also be included. Further, specific conditions for washing of the 
tissue sections prior to and following incubation in the specific antibody would be 
preferably included, such as for example, 4 washes with 0.1% tween-20 containing 
phosphate buffered saline for 5 minutes each. Such immunohistochemical detection 
protocols are known to a person skilled in the art. In general the kit would pre- 
ferably include a panel of images of specific immunohistochemical staining results 
from positive and negative tissue examples and in particular tables indicating 
which result is indicative of the patient suffering from the discorder to be diagno- 
sed as a user guide. Utilization of such a kit would preferably rule out, support or 
confirm diagnoses of the aforementioned liver disorders, liver cancer, or epithelial 
cancers according to the invention. 

As specified above for nucleic acid-based diagnostic approaches, diagnostics based 
on detection and/or quantitation of polypeptides according to the invention may 
include 1 or more of such polypeptides. Moreover, simultaneous detection of such 
polypeptides together with other peptides implicated in the disorders according to 
the invention may be employed in such diagnostics. 
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1. Diagnostic containing the nucleic acids according to SEQ ID No. 10 to 
SEQ ID No. 19 or variants thereof, combined with suitable additives and/or 
auxiliaries. 

2. A diagnostic according to claim 1, wherein the nucleic acid is a probe, pref- 
erentially a DNA probe. 

3. The use of the nucleic acids according to SEQ ID No. 10 to SEQ ID No. 19 
or variants thereof for the production of a diagnostic for the diagnosis of a 
liver disorder and an epithelial cancer, wherein the nucleic acids according 
to SEQ ID No. 10 to SEQ ID No. 19 or variants thereof are combined with 
suitable additives and/or auxiliaries. 

4. The use of accoiding to claim 3, wherein the liver disorder is a disorder 
selected from cirrhosis, alcoholic liver disease, chronic hepatitis, Wilson's 
Disease, heamochromatosis, hepatocellular carcinoma, benign liver neo- 
plasms, and focal nodular hyperplasia. 

5. The use according to claim 3, wherein the epithelial cancer is an adenocar- 
cinoma of any organ other than liver, preferably of an organ selected from 
the group consisting of the lung, the stomach, the kidney, the colon, the 
prostate, the skin, and the breast. 
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The invention relates to polypeptides, nucleic acids encoding these or a nucleic acid 
according to SEQ ID No. 19 and to their use for the diagnosis, prevention and/or 
treatment of liver disorders and neoplastic disorders especially liver cancer and/or 
other epithelial cancers. In particular, the present invention relates to the use of 
these nucleic acids and polypeptides in connection with hepatocellular carcinoma 
(HCC), other epithelia-derived cancers, and non-malignant and non-cancerous liver 
disorders (e.g., adenoma, focal nodular hyperplasia, cirrhosis). 
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Figure 2: OBcll.pr (SEQ ID No. 1) 



1 MYSSSCETTRNTTGIEESTDGMILGPEDLS 
31 YQIYDVSGESNS A VSTEDLKECLKKQLEFC 
61 FSRENLSKDLYLISQMDSDQFIPI WTVANM 
91 EEIKKLTTDPDLILEVLRSSPMVQVDEKGE 
121 KVRPSHKRCIVILREIPETTPIEE VKGLFK 
151 SENCPKVISCEFAHNSNWYITFQSDTDAQQ 
181 AFKYLREEVKTFQGKPIMARIKAINTFFAK 
211 NGYRLMDSSI YSHPIQTQAQYASP VFMQPV 
241 YNPHQQYSVYSIVPQSWSPNPTPYFETPLA 
271 PFPNGSFVNGFNSPGSYKTNAAAMNMGRPF 
301 QKNRVKPQFRSSGGSEHSTEGSVSLGDGQL 
331 NRYSSRNFPAERHNPTVTGHQEQTYLQKET 
361 STLQVEQNGDYGRGRRTLFRGRRRREDDRI 
391 SRPHPSTAESKAPTPKFDLLASNFPPLPGS 
421 SSRMPGELVLENRMSDVVKGVYKEKDNEEL 
45 1 TISCPVPADEQTECTSAQQLNMSTSSPCAA 
481 ELTALSTTQQEKDLIEDSSVQKDGLNQTTI 
511 PVSPPSTTKPSRASTASPCNNNINAATAVA 
541 LQEPRKLSYAEVCQKPPKEPSSVLVQPLRE 
571 LRSNVVSPTKNEDNGAPENSVEKPHEKPEA 
601 RASKDYSGFRGNIIPRGAAGKIREQRRQFS 1 
631 HRAIPQGVTRRNGKEQYVPPRSPK 
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Figure 3: OBcI5.pr sequence (SEQ ID No. 2) 

MGVELMMELE PLQGNEETRA LFMPREDTAR PQSASQEESS 
REPDHAGTLI VDISTSRTVI QNAYVSLEET LK 
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Figure 4: IK2.pr sequence (SEQ ID No.3) 

1 MLPPRRLQTL LRQAVELQRD RCLYHNTKLD NNLDSVSLLI 
41 DHVCSRRQFP CYTQQILTEH CNEVWFCKFS NDGTKLATGS 
8 1 KDTTVIIWQV DPDTHLLKLL KTLEGHAYGV SYIAWSPDDN 
121 YLVACGPDDC SELWLWNVQT GELRTKMSQS HEDSLTSVAW 
161 NPDGKRFVTG GQRGQFYQCD LDGNLLDSWE GVRVQCLWCL 
201 SDGKTVLASD THQRIRGYNF EDLTDRNIVQ EDHPIMSFTI 
241 SKNGRLALLN VATQGVHLWD LQDRVLVRKY QGVTQGFYTI 
281 HSCFGGHNED FIASGSEDHK VYIWHKRSEL PIAELTGHTR 
321 TVNCVS WNPQ IPSMMASASD DGTVRIWGPA PFIDHQNIEE 
361 ECSSMDS 
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Figure 5: IK5.pr sequence (SEQ ID No.4) 

1 MDVNIAPLRA WDDFFPGSDR FARPDFRDIS KWNNRWSNL 
41 LYYQTNYLW AAMMISIVGF LSPFNMILGG IWVLVFTGF 
8 1 V WAAHNKDVL RRMKKRYPTT FVMVVMLASY FLISMFGGVM 
121 VFVFGITFPL LLMFIHASLR LRNLKNKLEN KMEGIGLKRT 
161 PMGIVLDALE QQEEGINRLT DYISKVKE 
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Figure 6: DAP3.pr sequence (SEQ ID No.5) 

1 MMLKGITRLI SRIHKLDPGR FLHMGTQARQ SIAAHLDNQV 
41 PVESPRAISR TNENDPAKHG DQHEGQHYNI SPQDLETVFP 
8 1 HGLPPRFVMQ VKTFSEACLM VRKPALELLH YLKNTSFAYP 
121 AIRYLLYGEK GTGKTLSLCH VIHFCAKQDW LILHIPDAHL 
161 WVKNCRDLLQ SSYNKQRFDQ PLEASTWLKN FKTTNERFLN 
201 QIKVQEKYVW NKRESTEKGS PLGEWEQGI TRVRNATDAV 
241 GIVLKELKRQ SSLGMFHLLV AVDGINALWG RTTLKREDKS 
281 PIAPEELALV H^RKMMKND WHGGAIVSAL SQTGSLFKPR 
321 KAYLPQELLG KEGFDALDPF IPILVSNYNP KEFESCIQYY 
361 LENNWLQHEK APTEEGKKEL LFLSNANPSL LERHCAYL 
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Figure 7: LOC5.pr sequence (SEQ ID No.6) 

1 MAGPELLLDS NIRLWWLPI VIITFFVGMI RHYVSILLQS 

41 DKKLTQEQVS DSQVLIRSRV LRENGKYIPK QSFLTRKYYF 

81 NNPEDGFFKK TKRKWPPSP MTDPTMLTDM MKGNVTNVLP 

1 2 1 MILIGGWINM TFSGFVTTKV PFPLTLRFKP MLQQGIELLT 

161 LDAS WVSSAS WYFLNVFGLR SIYSLILGQD NAADQSRMMQ 

201 EQMTGAAMAM PADTNKAFKT EWEALELTDH QWALDDVEEE 

241 LMAKDLHFEG MFKKELQTSI F 
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Figure 8: SEC14L2.pr sequence (SEQ ID No.7) 

1 MSGRVGDLSP RQKEALAKFR ENVQDVLPAL PNPDDYFLLR 
41 WLRARSFDLQ KSEAMLRKHV EFRKQKDIDN HSWQPPEVI 
81 QQYLSGGMCG YDLDGCPVWY DIIGPLDAKG LLFSASKQDL 
121 LRTKMRECEL LLQECAHQTT KLGRKVETIT IIYDCEGLGL 
161 KHLWKPAVEA YGEFLCMFEE NYPETLKRLF VVKAPKLFPV 
201 AYNLIKPFLS EDTRKKIMVL GANWKEVLLK HISPDQVPVE 
24 1 YGGTMTDPDG NPKCKSKINY GGDIPRKYYV RDQVKQQYEH 
281 SVQISRGSSH QVEYEILFPG CVLRWQFMSD GADVGFGIFL 
321 KTKMGERQRA GEMTEVLPNQ RYNSHLVPED GTLTCSDPGI 
361 YVLRFDNTYS FIHAKKVNFT VEVLLPDKAS EEKMKQLGAG 
401 TPK 
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Figure 9: SSP29.pr sequence (SEQ ID No.8) 

1 MDMKRRIHLE LRNRTPAAVR ELVLDNCKSN DGKIEGLTAE 
41 FVNLEFLSLI NVGLISVSNL PKLPKLKKLE LSENRIFGGL 
8 1 DMLAEKLPNL THLNLSGNKL KDISTLEPLK KLECLKSLDL 
121 FNCEVTNLND YRESVFKLLP QLTYLDGYDR EDQEAPDSDA 
161 EVDGVDEEEE DEEGEDEEDE DDEDGEEEEF DEEDDEDEDV 
201 EGDEDDDEVS EEEEEFGLDE EDEDEDEDEE EEEGGKGEKR 
241 KRETDDEGEDD 
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Figure 10: HS16.pr sequence (SEQ ID No.9) 

1 MPRGSRSRTS RMAPPASRAP QMRAAPRPAP VAQPPAAAPP 
41 SAVGSSAAAP RQPGLMAQMA TTAAGVAVGS AVGHTLGHAI 
81 TGGFSGGSNA EPARPDITYQ EPQGTQPAQQ QQPCLYEIKQ 
121 FLECAQNQGD IKLCEGFNEV LKQCRLANGL A 
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Figure 11: OBcll sequence (SEQ ID No.10) 

1 ccgggtggag gggcaaggcg agtgtgtgtc cttatcctag caattggggc gcgggcctgt 
61 gagccagttg gagttgcggc ggcgggaacg attgggctga gcagaggacg acatgttgct 
121 tttcgtggag catttatggg gttaagtggc atgggatttc tgtttctgat agtaaatagc 
181 aggtagcatc taaaggaact ggtttaaatc ctaatgccaa agtatggcaa gaaattgctc 
241 ctggaaatac tgatgccacc ccagtaactc atggaactga aagctcttgg catgaaatag 
301 cagctacatc aggtgctcat cctgagggta atgcagagct ctcagaagat atatgtaaag 
361 aatatgaagt aatgtattct tcatcttgtg aaaccacaag aaatactaca ggcattgaag 
421 aatcaactga tgggatgatt ttaggaccag aagatctgag ttaccaaata tatgatgttt 
481 ccggagaaag caattcagca gtttctacag aagacctaaa agaatgtctg aagaaacaat 
541 tagaattctg tttttcacga gaaaatttgt caaaggatct ttacttgata tctcaaatgg 
601 atagtgatca gttcatccca atttggacag ttgccaacat ggaagaaata aaaaagttga 
661 ctacagaccc tgatctaatt cttgaagtgt taagatcttc tcccatggta caagttgatg 
72 1 agaagggtga gaaagtgaga ccaagtcata agcgttgtat tgtaattctt agagagattc 
781 ctgaaacaac accaatagag gaagtgaaag gtttgttcaa aagtgaaaac tgccccaaag 
841 tgataagctg tgagtttgca cacaatagca actggtatat cactttccag tcagacacag 
901 atgcacaaca ggcttttaaa tacttaagag aagaagttaa aacatttcag ggcaagccaa 
961 ttatggcaag gataaaagcc atcaatacat tttttgctaa gaatggttat cgattaatgg 
1021 attctagtat ctatagtcac cccattcaaa ctcaagcaca gtatgcctcc ccagtcttta 
1081 tgcagcctgt atataatcct caccaacagt actcggtcta tagtattgtg cctcagtctt 
1 141 ggtctccaaa tcctacacct tactttgaaa caccactggc tccctttccc aatggtagtt 
1201 ttgtgaatgg ctttaattcg ccaggatctt ataaaacaaa tgctgctgct atgaatatgg 
1261 gtcgaccatt ccaaaaaaat cgtgtgaagc ctcagtttag gtcatctggt ggttcagaac 
1 32 1 actcaacaga gggctctgta tccttggggg atggacagtt gaacagatat agttcaagaa 
1381 actttccagc tgaacggcat aaccccacag taactgggca tcaggagcaa acttaccttc 
1441 agaaggagac ttccactttg caggtggaac agaatgggga ctatggtagg ggcaggagaa 
1501 ctctcttcag aggtcgaaga cgacgagaag atgacaggat ctcaagacct catccttcaa 
1 561 cagctgaatc aaaggctcca acaccaaagt ttgacttatt agcctcaaat tttccacctt 
1621 tacctggaag ttcatcaaga atgccaggtg aactcgtttt ggagaatagg atgtctgatg 
1681 ttgttaaagg tgtctacaaa gaaaaggata atgaagagtt gacaattagt tgcccagtgc 
1741 ctgcagatga gcagacagaa tgcacttctg cccagcaact caatatgagt accagttctc 
1 801 catgtgctgc tgagcttact gcattaagca caactcagca agaaaaggat ctaatagaag 
1 861 attcctctgt tcagaaggat ggtctcaatc agacaactat accagtttct cctccaagta 



t 
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192 
198 
204 
210 
216 
222 
228 
234 
240 
246 
252 
258 
264 
270 
276 
282 
288 
294 
300 
306 
312 
318 
324 
330 
336 
342 
348 
354 
360 
366 
372 
378 
384 
390 



ctacaaagcc atcgagggca agtactgctt caccatgtaa taataacata aatgcagcta 
cagctgtggc tctacaggaa ccccgaaagt taagttatgc tgaagtgtgc cagaagcccc 
ctaaagagcc atcttcagtt cttgtgcagc cactacggga acttcgctcc aatgtggtgt 
ctcccaccaa aaatgaagac aatggagctc ctgagaactc cgttgagaaa ccacatgaga 
agccagaagc aagggctagt aaggattatt ctggcttccg aggcaatata atccccaggg 
gagcagcagg aaaaatcagg gaacagagac gccagtttag ccatagggct atacctcagg 
gagtgactcg acgtaatggc aaagagcaat atgtgccacc cagatcacca aagtaaaaaa 
caacaaaact attcaaaaac ttcactctct tcccattaaa cttgaactgt ggctatattg 
aactgttttg gaggggaggg ggtagccagg aaggaaacaa gagaaagtac gtccatttca 
ttatggattt tggagttgtg agtgatagga tcccaaaatt catctctaat gtggttttta 
aatgctggag gattccaatc aatataaata tatatatata tatacacaca catatataaa 
aagtataatt tttctatttt tgtttttggt tttaatttgc agagatttgc tgccaggaat 
caattttgag ggttcagatt tagcttggaa gaaaaaaaag aaacatacat ccttcagtat 
aggagatgag ggaatgagag aaaatatttt ttgaagaagc atttctgtaa aattagaaat 
tacttttttt aatctattta aagtttggct tgaagaatgc catctctgac tatatggcct 
tgtattgcaa agcagatcag tggctggggt gcctgttgtg ggtgtgagtg tgtacaagag 
cgattgaagc caaatctgtt gtcatgttag taaatgattt gaaaactgaa tgtaatactt 
gagtagattt ttttttctag tttgaaattt agtctgtctt tttgacctta ctaatatttc 
atttaacaag ttgtaaaact ctgattgtac ttagagatgt gactaccaat cagtttgata 
ctcaaggaaa gggggttatt caagaaattg aaaatttcat cttggacctc agtgcatcgg 
tcaaatggat ttcagaggtt taaacttccc tgtgattccc cctgaatacc cccaaaatga 
gaaacaaaat tttttttctt actccatttg ttactctctg ttctttgact gcccacccac 
agaaaagcaa aataaccaac tacctactca attgtgtgtt tgtaattgct ttgagcagtc 
tagtcaaatc atataaattg ttctaaattt cagaattgaa cattgaagta ttaactcttc 
tgttcacaca tttagaattt tagctcccaa gatggtaggg cagactgacc gtacagtaat 
ttatttgtcg ttagtgttaa agattaagca tagtaactga ctcttaagtg ttaaataatg 
tagaagtaaa aaaatttttt ttaaaggctt aatttgggag gggggactta tttctgttta 
cagtgtatta ccttccttcc ctcctcttct ccccccacac ccaacaaaat acagtttgga 
attcactgaa acagtaccag caagtcatga gattttttag taaagatgag aaagatggtt 
gaagaaaatt agtgcataat ttctcagtga ataaagttgt agctctcata tactaaatag 
acaagtttac atgctgttat ttagaaaatg actaaaatat taaaaaccgt gttgtgttaa 
tctgttttaa gtcataccat gttcagagtt ctatgtaagg tgggttttat ttttctttta 
agggatagtt tgtaatagta agaactgtcc catatgttag taaattacat atgtacaaat 
tgaaactgta aattgtgaac actggaaagc accattgtga catagagtaa acatcttagt 
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II aatatattaa agtgaatgta aatggtggtt aaaattacat tactgtgaaa ttcatcttcc 
\l aactctaagt taagctttgg agatacatgt tagtggttaa ctgttaagag ctttgaaaac 
!1 actgcacata tctgtacaag ccagaattac tatttctttg acttattatt agcttggcag 
H ttgcttttga tttgattgtt ttatgacatg gtatactact atatttactc agtttgaaac 
11 tattcatttc tacacactat ttttaaaaat tgcctactag gtgaaacata acaataaaac 
►1 tacctgtgct gaaatttggg ggaagtttag gtcctttaaa aaaacatatt aatcattgac 
:l tacatctatg ataaaagtgc ttattttggt ttactaagat aatgcagttg gtggaaatga 
!1 taaacgtttt aagtgttaac atcctttgaa tgcgttggat ttcagagaat aaacattttg ' 
H taaaaatcac ttggtaagga ttataaactt aattactgca cttaaaatga aacattactt 
il tttttaaaca atgtgtcaca aatgtaggtc tgtattactt gtatgcttgt gtgacttact 
il gttagtccag ctctaaaaat ttaaaggttg taattgaaat acaagaaaag agccttcttt 
11 tagaagaaag caagtatatt tttgctttta cttcaaatgt tatttaaagt agaaatttaa 
H tttgtagata taacctttaa aaattttctc attaagacaa tgtttttaat ttaatttgcc 
U tcattacatc taatagttcc catttgatgg catgtatagg gaagagtgag agagtgtgtg 
U tgtgtgtatg tgtgtgtaat atttatatat attcacagta tgtatttagc atttatttta 
»1 ttacagcaga tttaaagttt gtatctaaat aatgcctatg agttgtgtga agctcttggc 
11 tttttttcca acgttacttt gtaactaatg agggtggatg ttcattgtag tttatttatt 
>1 tggttcttta gatggaggaa tttaaaaaat caaatttttc tcttcacctt tatgacttga 
LI catttccttg atctgttgga ggctaaaagt aggtataaat gatattgaat gttgggtata 
►1 gtgatactct gccatagttc ttactgcatg aagagaacaa gagtcacaca agttcaccac 
>1 tttgcacttc atagagaagg tacatagaga cattgcaaaa cctgtctcca tttgctatcc 
11 tgataattaa ggttttcata atacctaggg cctgtctctg agtaatttta attttgccaa 
11 atacactgac atttaaaata gtgatccatc taaatttttt tcagctgggt tttgaggaat 
H ataagagctt tcaatgataa aggtttgttg tagttgtctt atgtgctgaa tttgcagatg 
II atcagatgct gtgcagaatt ctgatttatt tttgtttcct aaaattaaga tagcttgaat 
il attatttcac attccttttt cttttttaaa taaacaggtt tgctttggaa aggcttaatg 
tl atggaatgtt agcatcttca ctagggtaaa gaagaacaaa aagaatgttg ctggaacgta 
!1 aaatagtatt taaaagttaa tgaacacttc tctagttttc ttagttatgg ccttaataat 
H tagtctcttg gcttaaatgt ccactggttt tactttgaca cagttgaaca acactggggt 
>1 taagtctctg gtatttaggc tggcaatata tatattaacc atattttaaa agtaccaatt 
il ttgtttttac agaaaagata aaactcaaaa gagaacagtg tattccttct gaggggcttt 
II tataaattat taactataat atatgatgga ttttttccta attttttata tttccttaca 
!1 attttggtgg ccattaattt aactttaggc ttttgggcat atgctagtct gagcttccga 
H aaagatacat atatgtttcc cttttcatta gctgaatgag gatattttaa gaagttgaaa 
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6001 gagaatttat tttcaagttg tgagtaaatc ctcctttgaa attcacctga ttattagata 
6061 acttaaagtt tatttttaaa agctgacaac tttttatgaa tcttcgagtt gacagttcct 
6121 aaaagcgtaa ctcagatatt aatgggctgt gtattaaatg gttttatttt cagttttgca 
6181 gcacagaaca ctgttgaaat atccatatca acttgatttt tttaacctaa ttcaggtgtc 
6241 ctttgcatct cttaaatgtt gggggtgggg gtcagagcca gttatccggc ttctgttttg 
6301 tcgattgctt agatttgttc ctgttgtcaa aactgttacc cccaaaattg gtgtgacaca 
6361 tgctcatgca taaaatgtta aaatgagtac atccttgtat ttgtatttgt tttcaacatc 
6421 gccaaggtgc tatgggaaat taacaaaatt agaaaaaaaa taaaattatt aaaaagcaaa 
6481 aaaaaaaaaa aaaaaaa 
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Figure 12: OBcl5 sequence (SEQ ID No.ll) 

1 atgggggtgg aactcatgat ggaattggag cctttacaag ggaatgaaga gacaagagct 
6 1 ctctttatgc cacgtgagga tacagcaagg ccccaatctg caagccagga agagtcgtca 
121 cgagaaccag accatgcagg aactctgatc gtggacattt caacctccag aactgtgatc 
181 caaaatgcat atgtatcttt ggaagaaact ctgaagtaaa ggccggaata ttctttgttt 
241 aaaacattaa aaacaaaaca gaccaaagca tcaagcaaga agtttcctgg caataaacta 
301 agcacagcat tattttttaa ggaacacaaa ttaagtgttc aacctgtggc aaatttgtac 
361 tttctccctg aattatgttg ttatcaaaga aaaaaattgg gaagcatggc aaaatatcat 
421 caaaactgaa actagaatta aactaaatta aaataaaaaa aaaaaaaaaa aaaaaaaaaa 
481 aaaa 
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Figure 13: IK2 sequence (SEQ ID No.12) 

1 ctacgtgcaa aagcagaatg ggaaggctaa gggacagctt cccgatctaa actattggat 
6 1 aaacttcaga cctatttacc accatcagtg atgcttcccc cacggcgttt acagactctc 
121 ctgcggcagg cggtggaact acaaagggat cggtgcctat atcacaatac caaacttgat 
181 aataatctag attctgtgtc tctgcttata gaccatgttt gtagtaggag gcagttccca 
24 1 tgttatacgc agcagatact tacggagcat tgtaatgaag tgtggttctg taaattctct 
301 aatgatggca ctaaactagc aacaggatca aaagatacaa cagttatcat atggcaagtt 
361 gatccggata cacacctgct aaaactgctt aaaacattag aaggacatgc ttatggcgtt 
421 tcttatattg catggagtcc agatgacaac tatcttgttg cttgtggccc agatgactgc 
48 1 tctgagcttt ggctttggaa tgtacaaaca ggagaactaa ggacaaaaat gagccagtct 
541 catgaagaca gtttgacaag tgtggcttgg aatccagatg ggaagcgctt tgtgactgga 
60 1 ggtcagcgtg ggcagttcta tcagtgtgac ttagatggta atctccttga ctcctgggaa 
66 1 ggggtaagag tgcaatgcct ttggtgcttg agtgatggaa agactgttct ggcatcagat 
72 1 acacaccagc gaattcgggg ctataacttc gaggacctta cagataggaa catagtacaa 
. 78 1 gaagatcatc ctattatgtc ttttactatt tcaaaaaatg gccgattagc tttgttaaat 
84 1 gtagcaactc agggagttca tttatgggac ttgcaagaca gagttttagt aagaaagtat 
90 1 caaggtgtta cacaagggtt ttatacaatt cattcatgtt ttggaggcca taatgaagac 
96 1 ttcatcgcta gtggcagtga agatcacaag gtttacatct ggcacaaacg tagtgaactg 
1 02 1 ccaattgcgg agctgacagg gcacacacgt acagtaaact gtgtgagctg gaacccacag 
1081 attccatcca tgatggccag cgcctcagat gatggcactg ttagaatatg gggaccagca 
1141 ccttttatag accaccagaa tattgaagag gaatgcagta gcatggatag ttgatggtga 
1201 atttggagca gacgacctct gtttaactta aaattagtcg tattttaatg gcttgggatt 
1261 tggtgcaaac aaacatgatt gatagctgga cagacatgct cgtcatgaaa aaagaaccat 
1321 ttctgaagcc cgattggggc caaacattta caccttgctt catagtaacc agttgagatg 
1381 aagcacgtcg ttagaacgtt gttggacacc atgttgaatt attcccccat cggttgtgaa 
1441 gaactgtgct acattcaggc ttacccattg aactcagtat atatattttt ttccttcctg 
1501 tcttttgtct ggcaggatac cattcttgtt gctcttctgt gtaatgaagt ttaaatgctt 
1561 gtttggaaaa ctttatttaa cagtttagaa ggcttgatag aaagagtgca ttagtctgaa 
1 62 1 gagtatacat tggataggaa agaatttcct tcttttgttt ctccaaatct ttccgcctta 
1681 tttagcttga gatctttgca gcttggttca tggattctag ccttgcccgt tgcgcagtat 
1741 atactgatcc agatgataaa ccagtgaact atgtcaaaag cactctcaat attacatttg 
1801 acaaaaagtt ttgtactttt cacatagctt gttgccccgt aaaagggtta acagcacaat 
1 861 tttttaaaaa taaattaaga agtatttaaa aaaaaaaaaa aaaa 
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Figure 14: EK5 sequence (SEQ ID No.13) 

1 cgctgtcaac tctccaactc agctcagctg atcggttgcc gccgccgccg ccgccagatt 

6 1 ctggaggcga agaacgcaaa gctgagaaca tggacgttaa tatcgcccca ctccgcgcct 

1 2 1 gggacgattt cttcccgggt tccgatcgct ttgcccggcc ggacttcagg gacatttcca 

1 8 1 aatggaacaa ccgcgtagtg agcaacctgc tctattacca gaccaactac ctggtggtgg 

241 ctgccatgat gatttccatt gtggggtttc tgagtccctt caacatgatc ctgggaggaa 

301 tcgtggtggt gctggtgttc acagggtttg tgtgggcagc ccacaataaa gacgtccttc 

361 gccggatgaa gaagcgctac cccacgacgt tcgttatggt ggtcatgttg gcgagctatt 

42 1 tccttatctc catgtttgga ggagtcatgg tctttgtgtt tggcattact tttcctttgc 

48 1 tgttgatgtt tatccatgca tcgttgagac ttcggaacct caagaacaaa ctggagaata 

54 1 aaatggaagg aataggtttg aagaggacac cgatgggcat tgtcctggat gccctagaac 

601 agcaggaaga aggcatcaac agactcactg actatatcag caaagtgaag gaataaacat 

66 1 aacttacctg agctagggtt gcagcagaaa ttgagttgca gcttgccctt gtccagacct 

721 atgttctgct tgcgtttttg aaacaggagg tgcacgtacc acccaattat ctatggcagc 

78 1 atgcatgtat aggccgaact attatcagct ctgatgtttc agagagaaga cctcagaaac 

84 1 cgaaagaaaa ccaccaccct cctattgtgt ctgaagtttc acgtgtgttt atgaaatcta 

90 1 atgggaaatg gatcacacga tttctttaag ggaattaaaa aaaataaaag aattacggct 

96 1 tttacagcaa caatacgatt atcttatagg aaaaaaaaaa atcattgtaa agtatcaaga 

102 1 caatacgagt aaatgaaaag gctgttaaag tagatgacat catgtgttag cctgttccta 

1 08 1 aatccctaga attgtaatgt gtgggatata aattagtttt tattattctc ttaaaaatca 

1141 aagatgatct ctatcacttt gccacctgtt tgatgtgcag tggaaactgg ttaagccagt 

1 20 1 tgttcatact tcctttacaa atataaagat agctgtttag gatattttgt tacatttttg 

1261 taaatttttg aaatgctagt aatgtgtttt caccagcaag tatttgttgc aaacttaatg 

1321 tcattttcct taagatggtt acagctatgt aacctgtatt attctggacg gacttattaa 

1381 aatacaaaca gacaaaaaat aaaacaaaac ttgagttcta tttaccttgc acattttttg 

1441 ttgttacagt gaaaaaaatg gtccaagaaa atgtttgcca tttttgcatt gtttcgtttt " 

1501 taactggaac atttagaaag aaggaaatga atgtgcattt tattaattcc ttaggggcac 

1561 aaggaggaca ataatagctg atcttttgaa atttgaaaaa cgtctttaga tgaccaagca 

1621 aaaagacttt aaaaaatggt aatgaaaatg gaatgcagct actgcagcta ataaaaaatt 

1681 ttagatagca attgttacaa ccatatgcct ttatagctag acattagaat tatgatagca 

1741 tgagtttata cattctatta tttttcctcc ctttctcatg tttttataaa taggtaataa 

1 801 aaaatgtttt gcctgccaat tgaatgattt cgtagctgaa gtagaaacat ttaggtttct 

1861 gtagcattaa attgtgaaga caactggagt ggtacttact gaagaaactc tctgtatgtc 
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1 92 1 ctagaataag aagcaatgat gtgctgcttc tgatttttct tgcattttaa attctcagcc 
1 98 1 aacctacagc catgatcttt agcacagtga tatcaccatg acttcacaga catggtctag 
2041 aatctgtacc cttacccaca tatgaagaat aaaattgatt aaaggtta 
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Figure 15: DAP3 sequence (SEQ ID No.14) 

1 gccttttttg cagtctcagg acgggcgctt tggagccggc cccaggcagc gtgtgtcggt 
61 cgcctagtct ggagaactag tcctcgactc acggtgaggg aatggaccga cacgggtatt 
121 gtaccgctga gggaaaggag cgggactccg gacctccagg agtgcaagga tgatgctgaa 
181 aggaataaca aggcttatct ctaggatcca taagttggac cctgggcgtt ttttacacat 
241 ggggacccag gctcgccaaa gcattgctgc tcacctagat aaccaggttc cagttgagag 
301 tccgagagct atttcccgca ccaatgagaa tgacccggcc aagcatgggg atcagcacga 
361 gggtcagcac tacaacatct ccccccagga tttggagact gtatttcccc atggccttcc 
421 tcctcgcttt gtgatgcagg tgaagacatt cagtgaagct tgcctgatgg taaggaaacc 
481 agccctagaa cttctgcatt acctgaaaaa caccagtttt gcttatccag ctatacgata 
541 tcttctgtat ggagagaagg gaacaggaaa aaccctaagt ctttgccatg ttattcattt 
601 ctgtgcaaaa caggactggc tgatactaca tattccagat gctcatcttt gggtgaaaaa 
661 ttgtcgggat cttctgcagt ccagctacaa caaacagcgc tttgatcaac ctttagaggc 
721 ttcaacctgg ctgaagaatt tcaaaactac aaatgagcgc ttcctgaacc agataaaagt 
781 tcaagagaag tatgtctgga ataagagaga aagcactgag aaagggagtc ctctgggaga 
84 1 agtggttgaa cagggcataa cacgggtgag gaacgccaca gatgcagttg gaattgtgct 
90 1 gaaagagcta aagaggcaaa gttctttggg tatgtttcac ctcctagtgg ccgtggatgg 
961 aatcaatgct ctttggggaa gaaccactct gaaaagagaa gataaaagcc cgattgcccc 
102 1 cgaggaatta gcacttgttc acaacttgag gaaaatgatg aaaaatgatt ggcatggagg 
1081 cgccattgtgtcggctttgagccagactgggtctctctttaagccccggaaagcctatct 
1 141 gccccaggag ttgctgggaa aggaaggatt tgatgccctg gatcccttta ttcccatcct 
1201 ggtttccaac tataacccaa aggaatttga aagttgtatt cagtattatt tggaaaacaa 
1261 ttggcttcaa catgagaaag ctcctacaga agaagggaaa aaagagctgc tgttcctaag 
1321 taacgcgaac ccctcgctgc tggagcggca ctgtgcctac ctctaagcca agatcacagc 
1381 atgtgaggaa gacagtggac atctgcttta tgctggaccc agtaagatga ggaagtcggg 
1441 cagtacacag gaagaggagc caggcccttg tacctatggg attggacagg actgcagttg 
1501 gctctggacc tgcattaaaatgggtttcac tgtgaatgcg tgacaataag atattccctt 
1561 gttcctaaaa ctttatatca gtttattgga tgtggttttt cacatttaag ataattatgg 
1621 ctcttttcct aaaaaataaa atatctttct 
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Figure 16: LOC5 sequence (SEQ ID No.lS) 

1 actggaagac caggcagccc agctgaaggc agtaagctcg gctcacagtc gcaggagagt 

6 1 tctggggtac acgggcaaag gggcttgaga aggcccggag gcgaagccga agagaagcaa 

1 2 1 ctgtgccccg gagaagagaa gctcgcccat tccagactgg gaaccagctt tcagtgaaga 

181 tggcagggcc agaactgttg ctcgactcca acatccgcct ctgggtggtc ctacccatcg 

241 ttatcatcac tttcttcgta ggcatgatcc gccactacgt gtccatcctg ctgcagagcg 

301 acaagaagct cacccaggaa caagtatctg acagtcaagt cctaattcga agcagagtcc 

361 tcagggaaaa tggaaaatac attcccaaac agtctttctt gacacgaaaa tattatttca 

42 1 acaacccaga ggatggattt ttcaaaaaaa ctaaacggaa ggtagtgcca ccttctccta 

48 1 tgactgatcc tactatgttg acagacatga tgaaagggaa tgtaacaaat gtcctcccta 

541 tgattcttat tggtggatgg atcaacatga cattctcagg ctttgtcaca accaaggtcc 

60 1 catttccact gaccctccgt tttaagccta tgttacagca aggaatcgag ctactcacat 

661 tagatgcatc ctgggtgagt tctgcatcct ggtacttcct caatgtattt gggcttcgga 

721 gcatttactc tctgattctg ggccaagata atgccgctga ccaatcacga atgatgcagg 

781 agcagatgac gggagcagcc atggccatgc ccgcagacac aaacaaagct ttcaagacag 

841 agtgggaagc tttggagctg acggatcacc agtgggcact agatgatgtc gaagaagagc 

901 tcatggccaa agacctccac ttcgaaggca tgttcaaaaa ggaattacag acctctattt 

961 tttgaagacc gagcagggat tagctgtgtc aggaacttgg agttgcactt aaccttgtaa 

1 02 1 ctttgtttgg agctggcacc tcttgaaata aaaaggagga tgcacgagct ggcaggcatg 

1081 caaaaaaaaa aaaaaaaaaa aaaaaaaaa 
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Figure 17: SEC14L2 sequence (SEQ ID No.16) 

1 ccctactccg cctctcggga tcctttaaga ggcggggctt ggctgccagc tccgcggccc 
61 gggcaaaagg ctgggacttt actccgggtg gcggcgagga cgagtctgtg ctccatcagc 
1 2 1 tgccgcaccc gccgcctccc gcccccaaac cccatccccg cggttgagcc acgatgagcg 
1 8 1 gcagagtcgg cgatctgagc cccaggcaga aggaggcatt ggccaagttt cgggagaatg 
24 1 tccaggatgt gctgccggcc ctgccgaatc cagatgacta ttttctcctg cgttggctcc 
301 gagccagaag cttcgacctg cagaagtcgg aggccatgct ccggaagcat gtggagttcc 
361 gaaagcaaaa ggacattgac aacatcatta gctggcagcc tccagaggtg atccaacagt 
421 atctgtcagg gggtatgtgt ggctatgacc tggatggctg cccagtctgg tacgacataa 
48 1 ttggacctct ggatgccaag ggtctgctgt tctcagcctc caaacaggac ctgctgagga 
541 ccaagatgcg ggagtgtgag ctgcttctgc aagagtgtgc ccaccagacc acaaagttgg 
601 ggaggaaggt ggagaccatc accataattt atgactgcga ggggcttggc ctcaagcatc 
66 1 tctggaagcc tgctgtggag gcctatggag agtttctctg catgtttgag gaaaattatc 
72 1 ccgaaacact gaagcgtctt tttgttgtta aagcccccaa actgtttcct gtggcctata 
78 1 acctcatcaa acccttcctg agtgaggaca ctcgtaagaa gatcatggtc ctgggagcaa 
841 attggaagga ggttttactg aaacatatca gccctgacca ggtgcctgtg gagtatgggg 
90 1 gcaccatgac tgaccctgat ggaaacccca agtgcaaatc caagatcaac tacgggggtg 
961 acatccccag gaagtattat gtgcgagacc aggtgaaaca gcagtatgaa cacagcgtgc 
1021 agatttcccgtggctcctcccaccaagtggagtatgagatcctcttccctggctgtgtcc 
1081 tcaggtggca gtttatgtca gatggagcgg atgttggttt tgggattttc ctgaagacca 
1 141 agatgggaga gaggcagcgg gcaggggaga tgacagaggt gctgcccaac cagaggtaca 
1201 actcccacct ggtccctgaa gatgggaccc tcacctgcag tgatcctggc atctatgtcc 
1261 tgcggtttga caacacctac agcttcattc atgccaagaa ggtcaatttc actgtggagg 
1321 tcctgcttcc agacaaagcc tcagaagaga agatgaaaca gctgggggca ggcaccccga 
1381 aataacacct tctcctatag caggcctggc cccctcagtg tctccctgtc aatttctacc 
1441 ccttgtagca gtcattttcg cacaaccctg aagcccaaag aaactgggct ggaggacaga 
1501 cctcaggagc tttcatttca gttaggcaga ggaagagcga ctgcagtggg tctccgtgtc 
1 56 1 tatcaaatac ctaaggagtc cccaggagct ggctggccat cgtgatagga tctgtctgtc 
1621 ctgtaaactg tgccaacttc acctgtccag ggacagcgaa gctgggggtg gcggggggca 
1 68 1 tgtaccacag ggtggcagca gggaaaaaaa ttagaaaagg gtgaaagatt gggacttaac 
1 741 acttcaggga agtcagctgc cggggagaaa cttgctccta aatgaacaca taagtttaga 
1 80 1 tcgcaatgag gagtagcagg gtagctggtt gctagagtta cggtggggat cagaaactct 
1 86 1 tccaaacatt ttagcactga ggctggggta gcttttggct tttcccaggt ctcaggaggt 
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1921 ggcctgagtc agcacacatc ttcccactcg gtagacaggc tggcctctcc ctcactttga 

198 1 gactttggca actcctgggc cacacggcct gcctctttga ttactaatga ttgtcagtga 

2041 ctcagagctt cctgggactt cgggtaccca cccgctgttc tccatgcaaa caaagcgcca 

2101 gggaaatgac ccacagggat cgcagctgca gggagggcca gggaggttgg gggtgggagt 

2161 gaatgctaaa agcagatcgt ccagtgccct tttcagtgct accggcctct caccaagcag 

2221 tcctccatgt gagcaacccc gagacaaaaa tgctaagtgg gatcaagaga gcagcactcg 

2281 gagagggtgt ttgccagtct gagtgtcccg cggtgcccgc caacccgctt cctgactgac 

2341 ctgagcaagg tcttactaag cagtcccatc tctgtgggag gcatgcaacg cgtgcaggga 

2401 gttcaggtgc cggtcggcgt agccaggcct ggaggccccc caggcaggag gccgcccaaa 

2461 ggcggggccg gcgtctcgca gactaggggc tgggggcggc cacagacggc ctcgaaacca 

2521 cagcccttac cccaatccca cgagccccgc caacgaacca caggtgctgg gctttagaga 

2581 acatgggaag gcggccccag acctggcggg aacgcctttc cctcagagcc aggccccggc 

2641 cccgtctggg aagctcatct tgcgaagctg agggagctca gggcaaaggc caggctagcg 

2701 cggaccggaa ggggccgagg ctgcacgggc ctctgccaga acgctcagga catcccggcc 

2761 tgggtttaca acgctgttag gaaaattaac caatgaataa agcaacgttc agtgcgca 
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Figure 18: SSP29 sequence (SEQ ID No.17) 

1 gtcgacgcgg ccgcgctccg ctcccgtgag taacttggct ccgggggctc cgctcgcctg 

6 1 cccgcacgcc gcccgccacc caggaccgcg ccgccggcct ccgccgctag caaacccttc 

1 2 1 cgacggccct cgctgcgcaa gccgggacgc ctctcccccc tccgcccccg ccgcggaaag 

1 8 1 ttaagtttga agagggggga agaggggaac atggacatga agaggaggat ccacctggag 

24 1 ctgaggaacc ggaccccggc agctgttcga gaacttgtct tggacaattg caaatcaaat 

301 gatggaaaaa ttgagggctt aacagctgaa tttgtgaact tagagttcct cagtttaata 

361 aatgtaggct tgatctcagt ttcaaatctc cccaagctgc ctaaattgaa aaagcttgaa 

42 1 ctcagtgaaa atagaatctt tggaggtctg gacatgttag ctgaaaaact tccaaatctc 

48 1 acacatctaa acttaagtgg aaataaactg aaagatatca gcaccttgga acctttgaaa 

54 1 aagttagaat gtctgaaaag cctggacctc tttaactgtg aggttaccaa cctgaatgac 

60 1 taccgagaga gtgtcttcaa gctcctgccc cagcttacct acttggatgg ctatgaccga 

661 gaggaccagg aagcacctga ctcagatgcc gaggtggatg gtgtggatga agaggaggag 

72 1 gacgaagaag gagaagatga ggaagacgag gacgatgagg atggtgaaga agaggagttt 

78 1 gatgaagaag atgatgaaga tgaagatgta gaaggggatg aggacgacga tgaagtcagt 

841 gaggaggaag aagaatttgg acttgatgaa gaagatgaag atgaggatga ggatgaagag 

90 1 gaggaagaag gtgggaaagg tgaaaagagg aagagagaaa cagatgatga aggagaagat 

96 1 gattaagacc ccagatgacc tgcagaaaca gaactgttca gtattggttg gactgctcat 

1 02 1 ggattttgta gctgtttaaa aaaaaaaaaa aggtagctgt gatacaaacc ccaggacacc ' 

1081 cacccaccca aagagccaaa gaatagttcc tgtgacattc cgccttcctt ccatgtagtc 

1 141 cctcttggta atctaccacc aagcttgtgg acttcacccc aacaaaattg taagcgttgt 

1201 taggtttttg tgtaagattc ttgctgtagc gtggatagct gtgattggtg agtcaaccgt 

1261 ctgtggctac cagttacact gagattgtaa cagcattttt actttctgta caacaaaaaa 

1 32 1 gctttgtaaa taaaatctta acattttggg tctgtttttt catgctttgc tttttaatta 

1381 ttattattat tttttttaca ttaggacatt ttatgtgaca actgccaaaa aagtattttt 

1441 aagaatttaagcgaaataaa cagttactct ttggc 
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Figure 19: HS16 sequence (SEQ ID No.18) 

1 gcaaccactg cagctgggcc aagtcgctta gctcttcggt ggttgtcaca cgtccggagg 

6 1 cctagccgtc gcgtacctag gatgccgcgt ggaagccgaa gccgcacctc ccgcatggcc 

121 cctccggcca gccgggcccc tcagatgaga gctgcaccca ggccagcacc agtcgctcag 

! 8 I ccaccagcag cggcaccccc atctgcagtt ggctcttctg ctgctgcgcc ccggcagcca 

24 1 ggtctgatgg cccagatggc aaccactgca gctggcgtgg ctgtgggctc tgctgtgggg 

301 cacacattgg gtcacgccat tactgggggc ttcagtggag gaagtaatgc tgagcctgcg 

3 6 1 aggcctgaca tcacttacca ggagcctcag ggaacccagc cggcacagca gcagcagcct 

42 i tgcctctatg agatcaaaca gtttctggag tgtgcccaga accagggtga catcaagctc 

481 tgtgagggtt tcaatgaggt gctgaaacag tgccgacttg caaacggatt ggcctaatga 

541 agaagttcaa cctggagaga tggaaaatca gctctcataa ctaagttaat ttagtataaa 

601 aatagaattg atagtgaggg tataaagtgt aaccatcagt taaacctctc ctgtcattcc 

661 tagcttcctt gcttcagaat tgaaatggaa gtgggggtgt ccctactctg tagaatctgg 

72 1 gactgggcaa atgtttgtgt ggcctcctta aactagctgt tatgttatga ttttattctt 

781 tgtgagttaa ttagaataaa gtcattttct tacaaaaaaa aaaaaaaaaa aaaaaaaaaa 

841 a 
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Figure 20: IK3 sequence (SEQ ID No.19) 

1 gggctcgtca gatatattaa ttttacactt cagttttgat tggtgagaaa gtacccattc 

6 1 tcttcaaata atcaaagata attattattt tgttttgttt ttggaatcaa cagggaggcg 

121 caaagtataa agttgctgct aacatatata catatacatc catattttat aagggtgtct 

181 atgtatatat agacagtgtg tccacacaaa aaatagatac agttatcagt cagtcagttc 

24 1 ttccatgatt tagttttttt aaacgtagaa aagctattgt aaacgtctct ttccatttat 

301 tcttaatttt ttgacatatt ggtatttctt taaagggaaa tgaggaatgc acatcagtga 

361 ttgattgtca aacctcaccc cctgatttcc tacctaatct acccccacct aaccaatcaa 

42 1 tcacatccac aaattgtttt gtttgtttgt tagtcaggct tccaacagag ttcaatattt 

481 ctaacactct agtgcaataa aaattattat taaatagcta agaggtgtgc atgtgggaaa 

541 ggtcagtgca tatcccttta ggaggggaga atgttgtaat atatcagcta tcgagttgtt 

601 taaaaaaagt gtattcaatc gtatattgtc tatagtatgt gctatgaaat ttgcatttat 

661 gatatgtaac aggggcaaag ccaaattcat gttactctgt tcagtcagaa acattttgtg 

72 1 gcatacagca ttcctgggaa gtgctgtact ttgtttcgtt ttggttttag ttttgcattt 

78 1 agagtgcctt ataattgatg cctattttaa tagcatttct ttttagcttt tggttcgtat 

841 ttccattcac tgttcgtatc tgttactttc tattaaagca ttatctgttt accacatgta 

901 caaaaactct ttgaataata tgcattccta gttttcagcc aagacgggga tgttagtgat 

96 1 tgtaccagcc caaagcactt ggataatcag ggcccttctt ccttttataa tcaatcatca 

1021 acatcagaaa aagctacttg ttttatttat attcccttcc aaatccgctc tggaacatgc 

1081 agtaactgca ccaaacttat tttagtaaca aatatcattg gcaactttgg aatatatttg 

1141 atattccatt aggatttttc taaaagggga aataaactat atatatatat gtatcttacc 

1 201 cccaattctt ccaacagaat ttctatagga agccatggat gatggcataa gtttgccaca 

1 26 1 tattacatga ttttaaataa tcctcaaaat acccaaggaa ctcttaaaga gttttggtat 

1321 gagtatacta ctttggttta attttagctt catggatgtt ctgcatggaa ggatttttgt 

1381 tttccacatt ttcccattgc tagcagagtg aaatccaaga gaccaaacat ttgcaagcat 

1441 tgtatttgag cacttttgta aaaaacaaaa aaaaaaaaaa aaaaaa 
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Figure21: 
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Figure 22: 
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Figure 24: 
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Figure 27: 
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